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(57) Abstract: The invention provides in silico models for determining the physiological function of human cells, including human 
skeletal muscle cells. The models include a data structure relating a plurality of Homo sapiens reactions, a constraint set for the 
plurality of Homo sapiens reactions, and commands for determining a distribution of flux through the reactions that is predictive 
of a Homo sapiens physiological function. A model of the invention can further include a gene database containing information 
characterizing the associated gene or genes. A regulated Homo sapiens reaction can be represented in a mode] of the invention by 
including a variable constraint for the regulated reaction. The invention further provides methods for making an in silico Homo 
sapiens physiological function using a model of the invention. 
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HUMAN METABOLIC MODELS AND METHODS 
BACKGROUND OF THE INVENTION 

This invention relates generally to analysis 
of the activity of chemical reaction networks and, more 
5 specifically, to computational methods for simulating 
and predicting the activity of Homo sapiens reaction 
networks. 

Therapeutic agents, including drugs and 
gene-based agents, are being rapidly developed by the 

10 pharmaceutical industry with the goal of preventing or 
treating human disease. Dietary supplements, including 
herbal products, vitamins and amino acids, are also 
being developed and marketed by the nutraceutical 
industry. Because. of the complexity of the biochemical 

15 reaction networks in and between human cells, even 

relatively minor perturbations caused by a therapeutic 
agent or a dietary component in the abundance or 
activity of a particular target , such as a metabolite, 
gene or protein, can affect hundreds of biochemical 

20 * reactions. These perturbations can lead to desirable 
therapeutic effects, such as cell stasis or cell death 
in the case of cancer cells or other pathologically 
hyperproliferative cells. However, these perturbations 
can also lead to undesirable side effects, such as 

25 production of toxic byproducts, if the systemic effects 
of the perturbations are not taken into account. 

Current approaches to drug and nutraceutical 
development do not take into account the effect of a 
perturbation in a molecular target on systemic cellular 
30 behavior. In order to design effective methods of 
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repairing, engineering or disabling cellular 
activities , it is essential to understand human 
cellular behavior from an integrated perspective. 

Cellular metabolism, which is an example of a 
5 process involving a highly integrated network of 
biochemical reactions, is fundamental to all normal 
cellular or physiological processes, including 
homeostatis, proliferation, differentiation, programmed 
cell death (apoptosis) and motility. Alterations in 

10 cellular metabolism characterize a vast number of human 
diseases. For example, tissue injury is often 
characterized by increased catabolism of glucose, fatty 
acids and amino acids, which, if persistent, can lead 
to organ dysfunction. Conditions of low oxygen supply 

15 (hypoxia) and nutrient supply, such as occur in solid 
tumors, result in a myriad of adaptive metabolic 
changes including activation of glycolysis and 
neovascularization. Metabolic dysfunctions also 
contribute to neurodegenerative diseases, 

20 cardiovascular disease, neuromuscular diseases, obesity 
and diabetes. Currently, despite the importance of 
cellular metabolism to normal and pathological 
processes, a detailed systemic understanding of 
cellular metabolism in human cells is currently 

25 lacking. 

Thus, there exists a need for models that 
describe Homo -sapiens reaction networks, including core 
metabolic reaction networks and metabolic reaction 
networks in specialized cell types, which can be used 
30 to simulate different aspects of human cellular 
behavior under physiological, pathological and 
* therapeutic conditions. The present invention 
satisfies this need, and provides related advantages as 
well. 
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SUMMARY OF THE INVENTION 

The invention provides a computer readable 
medium or media, including: (a) a data structure 
relating a plurality of Homo sapiens reactants to a 
5 plurality of Homo sapiens reactions, wherein each of 
the Homo sapiens reactions includes a reactant 
identified as a substrate of the reaction, a reactant 
identified as a product of the reaction and a 
stoichiometric coefficient relating the substrate and 

10 the product, (b) a constraint set for the plurality of 
Homo sapiens reactions, and (c) commands for 
determining at least one flux distribution that 
minimizes or maximizes an objective function when the 
constraint set is applied to the data representation, 

15 wherein the at least one flux distribution is 

predictive of a Homo sapiens physiological function. 
In one embodiment, at least one of the Homo sapiens 
reactions in the data structure is annotated to 
indicate an associated gene and the computer readable 

20 medium or media further includes a gene database 

including information characterizing the associated 
gene. In another embodiment, at least one of the Homo 
sapiens reactions is a regulated reaction and the 
computer readable medium or media further includes a 

25 constraint set for the plurality of Homo sapiens 
reactions, wherein the constraint set includes a 
variable constraint for the regulated reaction. 

The invention provides a method for 
predicting a Homo sapiens physiological function, 
30 including: (a) providing a data structure relating a 
plurality of Homo sapiens reactants to a plurality of 
Homo sapiens reactions, wherein each of the Homo 
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sapiens reactions includes a reactant identified as a 
substrate of the reaction, a reactant identified as a 
product of the reaction and a stoichiometric 
coefficient relating the substrate and the product; (b) 
5 providing a constraint set for the plurality of Homo 
sapiens reactions; (c) providing an objective 
function, and (d) determining at least one flux 
distribution that minimizes or maximizes the objective 
function when the constraint set is applied to the data 

10 structure , thereby predicting a Homo sapiens 

physiological function. In one embodiment, at least 
one of the Homo sapiens reactions in the data structure 
is annotated to indicate an associated gene and the 
method predicts a Homo sapiens physiological function 

15 related to the gene. 

The invention provides a method for 
predicting a Homo sapiens physiological function, 
including: (a) providing a data structure relating a 
plurality of Homo sapiens reactants to a plurality of 

20 Homo sapiens reactions, wherein each of the Homo 

sapiens reactions includes a reactant identified as a 
substrate of the reaction, a reactant identified as a 
product of the reaction and a stoichiometric 
coefficient relating the substrate and the product, " 

25 wherein at least one of the Homo sapiens reactions is a 
regulated reaction; (b) providing a constraint set for 
the plurality of Homo sapiens reactions, wherein the 
constraint set includes a variable constraint for the 
regulated reaction; (c) providing a condition-dependent 

30 value to the variable constraint; (d) providing an 
objective function, and (e) determining at least one 
flux distribution that minimizes or maximizes the 
objective function when the constraint set is applied 
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to the data structure, thereby predicting a Homo 
sapiens physiological function. 

Also provided by the invention is a method 
for making a data structure relating a plurality of 
5 Homo sapiens reactants to a plurality of Homo sapiens 
reactions in a computer readable medium or media, 
including: (a) identifying a plurality of Homo sapiens 
reactions and a plurality of Homo sapiens reactants 
that are substrates and products of the Homo sapiens 

10 reactions; (b) relating the plurality of Homo sapiens 
reactants to the plurality of Homo. sapiens reactions in 
a data structure, wherein each of the Homo sapiens 
reactions includes a reactant identified as a substrate 
of the reaction, a reactant identified as a product of 

15 the reaction and a stoichiometric coefficient relating 
the substrate and the product; (c) determining a 
constraint set. for the plurality of Homo sapiens 
reactions; (d) providing an objective function; (e) 
determining at least one flux distribution that 

20 minimizes or maximizes the objective function when the 
constraint set is applied to the data structure, and 
(f) if the at least one flux distribution is not 
predictive of a Homo sapiens physiological function, 
then adding a reaction to or deleting a reaction from 

25 the data structure and repeating step (e) , if the at 
least one flux distribution is predictive of a Homo 
sapiens physiological function, then storing the data 
structure in a computer readable medium or media. The 
invention further provides a data structure relating a 

30 plurality of Homo sapiens reactants to a plurality of 
Homo sapiens reactions, wherein the data structure is 
produced by the method. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic representation of 
a hypothetical metabolic network. 

Figure 2 shows mass balance constraints and 
5 flux constraints (reversibility constraints) that can' 
be placed on the hypothetical metabolic network shown 
in Figure 1. 

Figure 3 shows the stoichiometric matrix (S) - 
for the hypothetical metabolic network shown in Figure 
10 1. 

Figure 4 shows , in Panel A, an exemplary 
biochemical reaction network and in Panel B, an 
exemplary regulatory control structure for the reaction 
network in panel A. 



15 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides in silico 
models that describe the interconnections between genes 
in the Homo sapiens genome and their associated 
reactions and reactants. The models can be used to 

20 simulate different aspects of the cellular behavior of 
human cells under different normal f pathological and 
therapeutic conditions, thereby providing valuable 
information for therapeutic, diagnostic and research 
applications. An advantage of the models of the 

25 invention is that they provide a holistic approach to 
simulating and predicting the activity of Homo sapiens 
cells. The models and methods can also be extended to 
simulate the activity of multiple interacting cells, 
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including organs, physiological systems and whole body 
metabolism. 

As an example, the Homo sapiens metabolic 
models of the invention can be used to determine the 
5 effects of changes from aerobic to anaerobic 

conditions, such as occurs in skeletal muscles during 
exercise or in tumors, or to determine the effect of 
various dietary changes. The Homo sapiens metabolic 
models can also be used to determine the consequences 
10 of genetic defects, such as deficiencies in metabolic 
enzymes such as phosphof ructokinase, phosphoglycerate 
kinase, phosphoglycerate mutase, lactate dehydrogenase 
and adenosine deaminase. 

The Homo sapiens metabolic models can also be 
15 used to choose appropriate targets for drug design. 
Such targets include genes, proteins or reactants, 
which when modulated positively or negatively in a 
simulation produce a desired therapeutic result. The 
models and methods of the invention can also be used to 
20 predict the effects of a therapeutic agent or dietary 
supplement on a cellular function of interest. 
Likewise, the models and methods can be used to predict 
both desirable and undesirable side effects of the 
therapeutic agent on an interrelated cellular function 
25 in the target cell, as well as the desirable and 

undesirable effects that may occur in other cell types. 
Thus, the models and methods of the invention can make 
the drug development process more rapid and cost 
effective than is currently possible. 

30 . The Homo sapiens metabolic models can also be 

used to predict or validate the assignment of 
particular biochemical reactions to the enzyme-encoding 
genes found in the genome, and to identify the presence 
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of reactions or pathways not indicated by current 
genomic data. Thus, the models can be used to guide 
the research and discovery process, potentially leading 
to the identification of new enzymes, medicines or 
5 metabolites of clinical importance. 

The models of the invention are based on a 
data structure relating a plurality of Homo sapiens 
reactants to a plurality of Homo sapiens reactions, 
wherein each of the Homo sapiens reactions includes a 

10 reactant identified as a substrate of the reaction, a 
reactant identified as a product of the reaction and a 
stoichiometric coefficient relating the .substrate and 
the product. The reactions included in the data 
structure can be those that are common to all or most 

15 Homo sapiens cells, such as core metabolic reactions, 
or reactions specific for one or more given cell type. 

As used herein, the term "Homo sapiens 
reaction'' is intended to mean a conversion that 
consumes a substrate or forms a product that occurs in 
or by a Homo sapiens cell. The term can include a 
conversion that occurs due to the activity of one or 
more enzymes that are genetically encoded by a Homo 
sapiens genome. The term can also include a conversion 
that occurs spontaneously in a Homo sapiens cell. 
Conversions included in the term include, for example, 
changes in chemical composition such as those due to 
nucleophilic or electrophilic addition, nucleophilic or 
electrophilic substitution, elimination, isomerization, 
deamination, phosphorylation, methylation, reduction, 
oxidation or changes in location such as those that 
occur due to a transport reaction that moves a reactant 
from one cellular compartment to another. in the case 
of a transport reaction, the substrate and product of 
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the reaction can be chemically the same and the 
substrate and product can be differentiated according 
to location in a particular cellular compartment. 
Thus, a reaction that transports a chemically unchanged 
reactant from a first compartment to a second 
compartment has as its substrate the reactant in the 
first compartment and as its. product the reactant in 
the second compartment. It will be understood that 
when used in reference to an in siliao model or data 
structure, a reaction is intended to be a 
representation of a chemical conversion that consumes a 
substrate or produces a product. 

As used herein, the term "Homo sapiens 
reactant" is intended to mean a chemical that is a 
substrate or a product of a reaction that occurs in or 
by a Homo sapiens cell. The term can include 
substrates or products of reactions performed by one or 
more enzymes encoded by a Homo sapiens genome, 
reactions occurring in Homo sapiens that are performed 
by one or more non-genetically encoded macromolecule, 
protein or enzyme, or reactions that occur 
spontaneously in a Homo sapiens cell. Metabolites are 
understood to be reactants within the meaning of the 
term. It will be understood that when used in 
reference to an in silico model or data structure, a 
reactant is intended to be a representation of a 
chemical that is a substrate or a product of a reaction 
that occurs in or by a Homo sapiens cell. 

As used herein the term xx substrate" is 
intended to mean a reactant that can be converted to 
one or more products by a reaction. The term can 
include, for example,, a reactant that is to be 
chemically changed due to nucleophilic or electrophilic 
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addition, nucleophilic or electrophilic substitution, 
elimination, isomerization, deamination, 
phosphorylation, methylation, reduction, oxidation or 
that is to change location such as by being transported 
5 across a membrane or to a different compartment. 



As used herein, the term "product" is 
intended to mean a reactant that results from a 
reaction with one or more substrates. The term can 
include, for example, a reactant that has been 

10 chemically changed due to nucleophilic or electrophilic 
addition, nucleophilic or electrophilic substitution, 
elimination, isomerization, deamination, 
phosphorylation, methylation, reduction or oxidation or 
that has changed location such as by being transported 

15 across a membrane or to a different compartment. 

As used herein, the term "stoichiometric 
coefficient" is intended to mean a numerical constant 
correlating the number of one or more reactants and the 
number of one or more products in a chemical reaction. 

20 Typically, the numbers are integers as they denote the 
number of molecules of each reactant in an elementally 
balanced chemical equation that describes the 
corresponding conversion. However, in some cases the 
numbers can take on non- integer values, for example, 

25 when used in a lumped reaction or to reflect empirical 
data. 



As used herein, the term "plurality, " when 
used in reference to Homo sapiens reactions or 
reactants, is intended to mean at least 2 reactions or 
30 reactants. The term can include any number of Homo 
sapiens reactions or reactants in the range from 2 to 
the number of naturally occurring reactants or 
reactions for a particular of Homo sapiens cell. Thus, 
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the term can include, for example, at least 10, 20, 30, 
50, 100, 150, 200, 300, 400, 500, 600 or more reactions 
or reactants. The number of reactions or reactants can 
be expressed as a portion of the total number of 
5 naturally occurring reactions for a particular Homo 

sapiens cell, such as at least 20%, 30%, 50%, 60%, 75%, 
90%, 95% or 98% of the total number of naturally 
occurring reactions that occur in a particular Homo 
sapiens cell. 

10 As used herein, the term "data structure" is 

intended to mean a physical or logical relationship 
among data elements, designed to support specific data 
manipulation functions. The term can include, for 
example, a list of data elements that can be added 

15 combined or otherwise manipulated such as a list of 

representations for reactions from which reactants can 
be related in a matrix or network. The term can also 
include a matrix that correlates data elements from two 
or more lists of information such as a matrix that 

20 correlates reactants to reactions. Information 

included in the term can represent, for example, a 
substrate, or product of a chemical reaction, a chemical 
reaction relating one or more substrates to one or more 
products, a constraint placed on a reaction, or a 

25 stoichiometric coefficient. 

As used herein, the term "constraint" is 
intended to mean an upper or lower boundary for a 
reaction. A boundary can specify a minimum or maximum 
flow of mass, electrons or energy through a reaction. 
30 A boundary can further specify directionality of a 

reaction. A boundary can be a constant value such as 
zero, infinity, or a numerical value such as an 
integer. Alternatively, a boundary can be a variable 
boundary value as set forth below. 
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As used herein, the term "variable," when 
used in reference to a constraint is intended to mean 
capable of assuming any of a set of values in response 
to being acted upon by a constraint function. The term 
"function, " when used -in the context of a constraint, 
is intended to be consistent with the meaning of the 
term as it is understood in the computer and 
mathematical arts. A function can be binary such that 
changes correspond to a reaction being off or on. 
Alternatively, continuous functions can be used such 
that changes in boundary values correspond to increases 
or decreases in activity. Such increases or decreases 
can also be binned or effectively digitized by a 
function capable of converting sets of values to 
discreet integer values. A function included in the 
term can correlate a boundary value with the presence, 
absence or amount of a biochemical reaction network 
participant such as a reactant, reaction, enzyme or 
gene. A function included in the term can correlate a 
boundary value with an outcome of at least one reaction 
in a reaction network that includes the reaction that 
is constrained by the boundary limit. A function 
included in the term can also correlate a boundary 
value with an environmental condition such as time, pH, 
temperature or redox potential. 

' As used herein, the term "activity," when 
used in reference to a reaction, is intended to mean 
the amount of product produced by the reaction, the 
amount of substrate consumed, by the reaction or the 
rate at which a product is produced or a substrate is 
consumed. The amount of product produced by the 
reaction, the amount of substrate consumed by the 
reaction or the rate at which a product is produced or 
a substrate is consumed can also be referred to as the 
flux for the reaction. 
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As used herein, the term "activity, " when 
used in reference to a Homo sapiens cell, is intended 
to mean the magnitude or rate of a change from an 
initial state to a final state. The term can include, 
5 for example, the amount of a chemical consumed or 
produced by a cell, the rate at which a chemical is 
consumed or produced by a cell,- the amount or rate of 
growth of a cell or the amount of or rate at which 
energy, mass or electrons flow through a particular 
10 subset of reactions. 



The invention provides a computer readable 
medium, having a data structure relating a plurality of 
Homo sapiens reactants to a plurality of 'Homo sapiens 
reactions, wherein each of the Homo sapiens reactions 
15 includes a reactant identified as a substrate of the 
reaction, a reactant identified as a product of the 
reaction and a stoichiometric coefficient relating the 
substrate and the product. 



Depending on the application, the plurality 
20 of Homo sapiens reactions can include reactions 

selected from core metabolic reactions or peripheral 
metabolic reactions. As used herein, the term "core," 
when used in reference to a metabolic pathway, is 
intended to mean a metabolic pathway selected from 
25 glycolysis/gluconeogenesis, the pentose phosphate 
pathway (PPP) , the tricarboxylic acid (TCA) cycle, 
glycogen storage, electron transfer system (ETS) , the 
malate/aspartate shuttle, the glycerol phosphate 
shuttle, and plasma and mitochondrial membrane 
30 transporters. As used herein, the term "peripheral, " 
when used in reference to a metabolic pathway, is 
intended to mean a metabolic pathway that includes one 
or more reactions that are not a part of a core 
metabolic pathway. 
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A plurality of Homo sapiens reactants can be 
related to a plurality of Homo sapiens reactions in any 
data structure that represents, for each reactant, the 
reactions by which it is consumed or produced. Thus, 
5 the data structure, which is referred to herein as a 
^reaction network data structure, " serves as a 
representation of a biological reaction network or 
system. An example of a reaction network that can be 
represented in a reaction network data structure of the 
10 invention is the collection of reactions that 
constitute the core metabolic reactions of Homo 
sapiens f or the metabolic reactions of a skeletal 
muscle cell, as shown in the Examples. 

The choice of reactions to include in a 
15 particular reaction network .data structure, from among 
all the possible reactions that can occur in human 
cells, depends on the cell type or types and the 
physiological, pathological or therapeutic condition 
being modeled, and can be determined experimentally or 
20 from the literature, as described further below. 



The reactions to be included in a particular 
network data structure of Homo sapiens can be 
determined experimentally using, for example, gene or 
protein expression profiles, where the molecular 

25 characteristics of the cell can be correlated to the 
expression levels. The expression or lack of . 
expression of genes or. proteins in a cell type can be 
used in determining whether a reaction is included in 
the model by association to the expressed gene(s) and 

30 or protein (s) . Thus, it is possible to use 

experimental technologies to determine which genes 
and/or proteins are expressed in a specific cell type, 
and to further use this information to. determine which 
reactions are present in the cell type of interest. In 
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this way a subset of reactions from all of those 
reactions that can occur in human cells are selected to 
comprise the set of reactions that represent a specific 
cell type. cDNA expression profiles have been 
5 demonstrated to be useful, for example, for 

classification of breast cancer cells (Sorlie et al., 
Proc. Natl. Acad. Sci. U.S.A. 98 (19) : 10869-10874 
(2001)). 

The methods and models of the invention can 
10 be applied to any Homo sapiens cell type at any stage 
of differentiation, including, for example, embryonic 
stem cells, hematopoietic stem cells, differentiated 
hematopoietic cells, skeletal muscle cells, cardiac 
muscle cells, smooth muscle cells, skin cells, nerve 
15 cells, kidney cells, pulmonary cells, liver cells, 

adipocytes and endocrine cells (e.g. beta islet cells 
of the pancreas, mammary gland cells, adrenal cells, 
and other specialized hormone secreting cells) . 

The methods and models of the invention can 
be applied to normal cells or pathological cells. 
Normal cells that exhibit a variety of physiological 
activities of interest, including homeostasis, 
proliferation, differentiation, apoptosis, contraction 
and motility, can be modeled. Pathological cells can 
also be modeled, including cells that reflect genetic 
or developmental abnormalities, nutritional 
deficiencies, environmental assaults, infection (such 
as by bacteria, viral, protozoan or fungal agents) , 
neoplasia, aging, altered immune or endocrine function, 
tissue damage, or any combination of these factors. 
The pathological cells can be representative of any 
type of human pathology, including, for example, 
various metabolic disorders of carbohydrate, lipid or 
protein metabolism, obesity, diabetes, cardiovascular 
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disease, fibrosis, various cancers, kidney failure, 
immune pathologies, neurodegenerative diseases, and 
various monogenetic metabolic diseases described in the 
Online Mendelian Inheritance in Man database (Center 
5 for Medical Genetics, Johns Hopkins University 

(Baltimore, MD) and National Center for Biotechnology 
Information, National Library of Medicine (Bethesda, 
MD) ) . 

The methods and models of the invention can 

10 also be applied to cells undergoing therapeutic 

perturbations , such as cells treated with drugs that 
target participants in a reaction network, cells 
treated with gene-based therapeutics that increase or 
decrease expression of an encoded protein, and cells 

15 treated with radiation. As used herein, the term 
"drug" refers to a compound of any molecular nature 
with a known or proposed therapeutic function, 
including, for example, small molecule compounds, 
peptides and other macromolecules, peptidomimetics and 

20 antibodies, any of which can optionally be tagged with 
cytostatic, targeting or detectable moieties. The term 
"gene-based therapeutic" refers- .to .-nucleic acid 
therapeutics, including, for example, expressible genes 
with normal or altered protein activity, antisense 

25 compounds, ribozymes, DNAzymes, RNA interference 

compounds (RNAi) and the like. The therapeutics can 
target any reaction network participant, in any 
cellular location, including participants in 
extracellular, cell surface, cytoplasmic, mitochondrial 

30 and nuclear locations. Experimental data that are 
gathered on the response of cells to therapeutic 
treatment, such as alterations in gene or protein 
expression profiles, can be used to tailor a network 
for a pathological state of a particular cell type. 
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The methods and models of the invention can 
be applied to Homo sapiens cells as they exist in any 
form, such as in primary cell isolates or in 
established cell lines, or in the whole body, in intact 
5 organs or in tissue explants. Accordingly, the methods 
and models can take into account intercellular 
communications and/or inter-organ communications, the 
effect of adhesion to a substrate or neighboring cells 
(such as a stem cell interacting with mesenchymal cells 
10 or a cancer cell interacting with its tissue 

microenvironment, or beta-islet cells without normal 
stroma) , and other interactions relevant to 
multicellular systems. 

The reactants to be used in a reaction 

15 network data structure of the invention can be obtained 
from or stored in a compound database. As used herein, 
the term ^compound database" is intended to mean a 
computer readable medium or media containing a 
plurality of molecules that includes substrates and 

20 products of biological reactions. The plurality of 
molecules can include molecules found in multiple 
organisms, -thereby constituting a universal compound 
database. Alternatively, the plurality of molecules 
can be limited to those that occur in a particular 

25 organism,, thereby constituting an organism-specific 
compound database. Each reactant in a compound 
database can be identified according to the chemical 
species and the cellular compartment in which it is 
present. Thus, for example, a distinction can be made 

30 between glucose in the extracellular compartment versus 
glucose in the cytosol. Additionally each of the 
reactants can be specified as a metabolite of a primary 
or secondary metabolic pathway. Although 
identification of a reactant as a metabolite of a 

35 primary or secondary metabolic pathway does not 
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indicate any chemical distinction between the reactants 
in a reaction, such a designation can assist in visual 
representations of large networks of reactions. 

As used herein, the term * compartment" is 
5 intended to mean a subdivided region containing at 

least one reactant, such that the reactant is separated 
from at least one other reactant in a second region. A 
subdivided region included in the term can be 
correlated with a subdivided region of a cell. Thus, a 
10 subdivided region included in the term can be, for 
example, the intracellular space of a cell; the ' 
extracellular space around a cell; the periplasmic 
space, the interior space of an organelle such as a 
mitochondrium, endoplasmic reticulum, Golgi apparatus, 
15 vacuole or nucleus; or any subcellular space that is 
separated from another by a membrane or other physical 
barrier. Subdivided regions can also be made in order 
to create virtual boundaries in a reaction network that 
are not correlated with physical barriers. Virtual 
20 boundaries can be made for the purpose of segmenting 
the reactions in a network into different compartments 
or substructures. 

As used herein, the term ^substructure" is 
intended to mean a portion of the information in a data 
structure that is separated from other information in 
the data structure such that the portion of information 
can be separately manipulated or analyzed. The term 
can include portions subdivided according to a 
biological function including, for example, information 
relevant to a particular metabolic pathway such as an 
internal flux pathway, exchange flux pathway, central 
metabolic pathway, peripheral metabolic pathway, or 
secondary metabolic pathway. The term can include 
portions subdivided according to computational or 
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mathematical principles that allow for a particular 
type of analysis or manipulation of the data structure. 

The reactions included in a reaction network 
data structure can be obtained from a metabolic 
5 reaction database that includes the substrates, 

products, and stoichiometry of a plurality of metabolic 
reactions of Homo sapiens. The reactants in a reaction 
network data structure can be designated as either 
substrates or products of a particular reaction, each 

10 with a stoichiometric coefficient assigned to it to 
describe the chemical conversion taking place in the 
reaction. Each reaction is also described as occurring 
in either a reversible or irreversible direction. 
Reversible reactions can either be represented as one 

15 reaction that operates in both the forward and reverse 
direction or be decomposed into two irreversible 
reactions, one corresponding to the forward reaction 
and the other corresponding to the backward reaction. 

Reactions included in a reaction network data 
20 structure can include intra-system or exchange 

reactions. Intra-system reactions are the chemically 
and electrically balanced inter conversions of chemical 
species and transport processes, which serve to 
replenish or drain the relative amounts of certain 
25 metabolites. These intra-system reactions can be 
classified as either being transformations or 
translocations. A transformation is a reaction that 
contains distinct sets of compounds as substrates and 
products, while a translocation contains reactants 
30 located in different compartments. Thus a reaction 
that simply transports a metabolite from the 
extracellular environment to the cytosol, without 
changing its chemical composition is solely classified 
as a translocation, while a reaction that takes an 
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extracellular substrate and converts it into a 
cytosolic product is both a translocation and a 
transformation. 

Exchange reactions are those which constitute 
5 sources and sinks, allowing the passage of metabolites 
into and out of a compartment or across a hypothetical 
system boundary. These reactions are included in a 
model for simulation purposes and represent the 
metabolic demands placed on Hojdo sapiens. While they 
10 may be chemically balanced in certain cases f they are 
typically not balanced and can often have only a single 
substrate or product. As a matter of convention the 
exchange reactions are further classified into demand 
exchange and input/output exchange reactions. 

15 The metabolic demands placed on the Homo 

sapiens metabolic reaction network can be readily 
determined from the dry weight composition of the cell 
which is available in the published literature or which 
can be determined experimentally. The uptake rates and 

20 maintenance requirements for Homo sapiens cells can 
also be obtained from the. published literature or 
determined experimentally. 

Input /output exchange reactions are used to 
allow extracellular reactants to enter or exit the 

25 reaction network represented by a model of the 

invention. For each of the extracellular metabolites a 
corresponding input /output exchange reaction can be 
created. These reactions are always reversible with 
the metabolite indicated as a substrate with a 

30 stoichiometric coefficient of one and no products 

produced by the reaction. This particular convention 
is adopted to allow the reaction to take on a positive 
flux value (activity level) when the metabolite is 
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being produced or rembved from the reaction network and 
a negative flux value when the metabolite is being 
consumed or introduced into the reaction network. 
These reactions will be further constrained during the 
5 course of a simulation to specify exactly which 

metabolites are available to the cell and which can be 
excreted by the cell. 

A demand exchange reaction is always 
specified as an irreversible reaction containing at 
least one substrate. These reactions are typically 
formulated to represent the production of an 
intracellular metabolite by the metabolic network or 
the aggregate production of many reactants in balanced 
ratios such as in the representation of a reaction that 
leads to biomass formation, also referred to as growth. 

A demand exchange reactions can be introduced 
for any metabolite in a model of the invention. Most 
commonly these reactions are introduced for metabolites 
that are required to be produced by the cell for the 
20 purposes of creating a new cell such as amino acids, 
nucleotides, phospholipids, and other biomass 
constituents, or metabolites that are to be produced 
for alternative purposes. Once these metabolites are 
identified, a demand exchange reaction that is 
25 irreversible and specifies the metabolite as a 

substrate with a stoichiometric coefficient of unity 
can be created. With these specifications, if the 
reaction is active it leads to the net production of 
the metabolite by the system meeting potential 
30 production demands. Examples of processes that can be 
represented as a demand exchange reaction in a reaction 
network data structure and analyzed by the methods of 
the invention include, for example, production or 
secretion of an individual protein; production or 
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secretion of an individual metabolite such as an amino 
acid, vitamin, nucleoside, antibiotic or surfactants- 
production of ATP for extraneous energy requiring 
processes such as locomotion; or formation of biomass 
constituents. 

In addition to these demand exchange 
reactions that are placed on individual metabolites, 
demand exchange reactions that utilize multiple 
metabolites in defined stoichiometric ratios can be 
introduced. These reactions are referred to as 
aggregate demand exchange reactions. An example of an 
aggregate demand reaction is a reaction used to 
simulate the concurrent growth demands or production 
requirements associated with cell growth that are 
placed on a cell, for example, by simulating the 
formation of multiple biomass constituents 
simultaneously at a particular cellular growth rate. 

A hypothetical reaction network is provided 
in Figure 1 to exemplify the above-described reactions 
and their interactions. The reactions can be 
represented in the exemplary data structure shown in 
Figure 3 as set forth below. The reaction network, 
shown in Figure 1, includes intrasystem reactions that 
occur entirely within the compartment indicated by the 
shaded oval such as reversible reaction R 2 which acts 
on reactants B and G and reaction R 3 which converts one 
equivalent of B to 2 equivalents of F. The reaction 
network shown in Figure 1 also contains exchange 
reactions such as input/output exchange reactions A^ 
and E xt , and the demand exchange reaction, V growth , which 
represents growth in response to the one equivalent of 
D and one equivalent of F. Other intrasystem reactions 
include R x which is a translocation and transformation 
reaction that translocates reactant A into the 
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compartment and transforms it to reactant G and 
reaction R 6 which is a transport reaction that 
translocates reactant E out of the compartment . 

A reaction network can be represented as a 
5 set of linear algebraic equations which can be 

presented as a stoichiometric matrix S, with S being an 
m x n matrix where m corresponds to the number of 
reactants or metabolites and n corresponds to the 
number of reactions taking place in the network. An 
10 example of a stoichiometric matrix representing the 

reaction network of Figure 1 is shown in Figure 3. As 
shown in Figure 3, each column in the matrix 
corresponds to a particular reaction n, each row 
corresponds to a particular reactant m, and each 
15 element corresponds to the stoichiometric coefficient 
of the reactant m in the reaction denoted n. The 
stoichiometric matrix includes intra-system reactions 
such as R 2 and R 3 which are related to reactants that 
participate in the respective reactions according to a 
20 stoichiometric coefficient having a sign indicative of 
whether the reactant is a substrate or product of the 
reaction and a value correlated with the number of 
equivalents of the reactant consumed or produced by the 
reaction. Exchange reactions such as -E^ and ~A xt are 
25 similarly correlated with a stoichiometric coefficient. 
As exemplified by reactant E, the same compound can be 
treated separately as an internal reactant (E) and an 
external reactant (E external ) such that an exchange 
reaction (Rg) exporting the compound is correlated by 
30 stoichiometric coefficients of -1 and 1, respectively. 
However, because the compound is treated as a separate 
reactant by virtue of its compartmental location, a 
reaction, such as R 5 , which produces the internal 
reactant (E) but does not act on the external reactant 
35 (External) is correlated by stoichiometric coefficients 
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of 1 and 0, respectively. Demand reactions such as 
Vgrowth car * also be included in the stoichiometric matrix 
being correlated with substrates by an appropriate 
stoichiometric coefficient. 

5 

As set forth in further detail below, a 
stoichiometric matrix provides a convenient format for 
representing and analyzing a reaction network because 
it can be readily manipulated and used to compute 

10 network properties, for example, by using linear 

programming or general convex analysis. A reaction 
network data structure can take on a variety of formats 
so long as it is capable of relating reactants and 
reactions in the manner exemplified above for a 

15 stoichiometric matrix and in a manner that can be 

manipulated to determine an activity of one or more 

•i 

reactions using methods such as those exemplified 
below. Other examples of reaction network data 
structures that are useful in. the invention include a 
20 connected graph, list of chemical reactions or a table 
of reaction equations. 

A reaction network data structure can be 
constructed to include all reactions that are involved 

25 in Homo sapiens metabolism or any portion thereof. A 
portion of Homo sapiens metabolic reactions that can be 
included in a reaction network data structure of the 
invention includes, for example, a central metabolic 
pathway such as glycolysis, the TCA cycle, the PPP or 

30 ETS; or a peripheral metabolic pathway such as amino 
acid biosynthesis, amino acid degradation, purine 
biosynthesis, pyrimidine biosynthesis, lipid 
biosynthesis, fatty acid metabolism, vitamin or 
cofactor biosynthesis, transport processes and 

35 alternative carbon source catabolism. Examples of 
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individual pathways within the peripheral pathways are 
set forth in Table 1. 

Depending upon a particular application, a 
reaction network data structure can include a plurality 
of Homo sapiens reactions including any or all of the 
reactions listed in Table 1. 

For some applications , it can be advantageous 
to use a reaction network data structure that includes 
a minimal number of reactions to achieve a particular 
Homo sapiens activity under a particular set of 
environmental conditions. A reaction network data 
structure having a minimal number of reactions can be 
identified by performing the simulation methods 
described below in an iterative fashion where different 
reactions or sets of reactions are systematically 
removed a*nd the effects observed. Accordingly , the 
invention provides a computer readable medium, 
containing a data structure relating a plurality of 
Homo sapiens reactants to a plurality of Homo sapiens 
reactions, wherein the plurality of Homo sapiens 
reactions contains at least 65 reactions. For example, 
the core metabolic reaction database shown in Tables 2 
and 3 contains 65 reactions, and is sufficient to 
simulate aerobic and anaerobic metabolism on a number 
of carbon sources, including glucose. 

Depending upon the particular cell type or 
types, the physiological, pathological or therapeutic 
conditions being tested and the desired activity, a 
reaction network data structure can contain smaller 
numbers of reactions such as at least 200, 150, 100 or 
50 reactions. A reaction network data structure having 
relatively few reactions can provide the advantage of 
reducing computation time and resources required to 
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perform a simulation. When desired, a reaction network 
data structure having a particular subset of reactions 
can be made or used in which reactions that are not 
relevant to the particular simulation are omitted. 
5 Alternatively, larger numbers of reactions can be 

included in order to increase the accuracy or molecular 
detail of the methods of the invention or to suit a 
particular application. Thus, a reaction network data 
structure can contain at least 300, 350, 400, 450, 500, 

10 550, 600 or more reactions up to the number of 

reactions that occur in or by Homo sapiens or that are 
desired to simulate the activity of the full set of 
reactions occurring in Homo sapiens. A reaction 
network data structure that is substantially complete 

15 with respect to the metabolic reactions of Homo sapiens 
provides the advantage of being relevant to a wide 
range of conditions to be simulated, whereas those with 
smaller numbers of metabolic reactions are limited to a 
particular subset of conditions to be simulated. 

20 A Homo sapiens reaction network data 

structure can include one or more reactions that occur 
in or by Homo sapiens and that do not occur, either 
naturally or following manipulation, in or by another 
organism, such as Saccharomyces cerevisiae. It is 

25 understood that a Homo sapiens reaction network data 
structure of a particular cell type can also include 
one or more reactions that occur in another cell type. 
Addition of such heterologous reactions to a reaction 
network data structure of the invention can be used in 

30 methods to predict the consequences of heterologous 

gene transfer and protein expression, for example, when 
designing in vivo and ex vivo gene therapy approaches. 
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The reactions included in a reaction network 
data structure of the invention can be metabolic 
reactions. A reaction network data structure can also 
be constructed to include other types of reactions such 
5 as regulatory reactions , signal transduction reactions, 
cell cycle reactions, reactions controlling 
developmental processes, reactions involved in 
apoptosis, reactions involved in responses to hypoxia, 
reactions involved in responses to. cell-cell or cell- 
10 substrate interactions, reactions involved in protein 
synthesis and regulation thereof, reactions involved in 
gene transcription and translation, and regulation 
thereof, and reactions involved in assembly of a cell 
and its subcellular components. 

15 A reaction network data structure or index of 

reactions used in the data structure such as that 
available in a metabolic reaction database, as 
described above, can be annotated to include 
information about a particular reaction. A reaction 

20 can be annotated to indicate, for example, assignment 
of the reaction to a .protein, macromolecule or enzyme 
that performs the reaction, assignment of a gene(s) 
that codes for the protein, macromolecule or enzyme, 
the Enzyme Commission (EC) number of the particular 

25 metabolic reaction, a subset of reactions to which the 
reaction belongs, citations to references from which 
information was obtained, or a level of confidence with 
which a reaction is believed to occur in Homo sapiens. 
A computer readable medium or media of the invention 

30 can include a gene database containing annotated 

reactions. Such information can be obtained during the 
course of building a metabolic reaction database or 
model of the invention as described below. 
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As used herein, the term "gene database" is 
intended to mean a computer readable medium or media 
that contains at least one reaction that is annotated 
to assign a reaction to one or more macromolecules that 
5 perform the reaction or to assign one or more nucleic 
acid that encodes the one or more macromolecules that 
perform the reaction. A gene database can contain a 
plurality of reactions, some or all of which are 
annotated. An annotation can include, for example, a 

10 name for a macromolecule; assignment of a function to a 
macromolecule; assignment of an organism that contains 
the macromolecule or produces the macromolecule; 
assignment of a subcellular location for the 
macromolecule; assignment of conditions under which a 

15 macromolecule is regulated with respect to performing a 
reaction, being expressed or being degraded; assignment 
of a cellular component that regulates a macromolecule; 
an amino acid or nucleotide sequence for the 
macromolecule; or any other annotation found for a 

20 macromolecule in a genome database such as those that 
can be found in Genbank, a site maintained by the NCBI 
(ncbi.nlm.gov), the Kyoto Encyclopedia of Genes and 
Genomes (KEGG) (www.genome.ad.jp/kegg/), the protein 
database SWISS-PR0T (ca.expasy.org/sprot/), the 

25 LocusLink database maintained by the NCBI 

(www.ncbi.nlm.nih.gov/LocusLink/), the Enzyme 
Nomenclature database maintained by G.P. Moss of Queen ; 
Mary and Westfield College in the United Kingdom 
(www.chem.qmw.ac.uk/iubmb/enzyme/) . 

30 A gene database of the invention can include 

a substantially complete collection of genes or open 
reading frames in Homo sapiens or a substantially 
complete collection of the macromolecules encoded by 
the Homo sapiens genome. Alternatively, a gene 

35 database can include a portion of genes or open reading 
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frames in Homo sapiens or a portion of the 
macromolecules encoded by the Homo sapiens genome, such 
as the portion that includes substantially all 
metabolic genes or macromolecules. The portion can be 
5 at least 10% f 15%, 20%, 25%, 50%, 75%, 90% or 95% of 
the genes or open reading frames encoded by the Homo 
sapiens genome, or the macromolecules encoded therein. 
A gene database can also include macromolecules encoded 
by at least- a portion of the nucleotide sequence for 

10 the Homo sapiens genome such as at least 10%, 15%, 20%, 
25%, 50%, 75%, 90% or 95% of the Homo sapiens genome. 
Accordingly, a computer readable medium or media of the 
invention can include at least one reaction for each 
macromolecule encoded by a portion of the Homo sapiens 

15 genome. 

An in silico Homo sapiens model of the 
invention can be built by an iterative process which 
includes gathering information regarding particular 
reactions to be added to a model, representing the 
20 reactions in a reaction network data structure, and 
performing preliminary simulations wherein a set of 
constraints is placed on the reaction network and the 
output evaluated to identify errors in the network. 
Errors in the network such as gaps that lead to non- 
25 natural accumulation or consumption of a particular 
metabolite can be identified as described below and 
simulations repeated until a desired performance of the 
model is attained. An exemplary method for iterative 
model construction is provided in Example I. 

30 Thus, the invention provides a method for 

making a data structure relating a plurality of Homo 
sapiens reactants to a plurality of Homo sapiens 
reactions in a computer readable medium or media. The 
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method includes the steps of: (a) identifying a 
plurality of Homo sapiens reactions and a plurality of 
Homo sapiens reactants that are substrates and products 
of the Homo sapiens reactions; (b) relating the 
5 plurality of Homo sapiens reactants- to the plurality of 
Homo sapiens reactions in a data structure, wherein 
each of the Homo sapiens reactions includes a reactant 
identified as a substrate of the reaction, a reactant 
identified as a product of the reaction and a 

10 stoichiometric coefficient relating the substrate and 
the product; (c) making a constraint set for the 
plurality of Homo sapiens reactions; (d) providing an 
objective function; (e) determining at least one flux 
distribution that minimizes or maximizes the objective 

15 function when the constraint set is applied to the data 
structure, and (f) if the at least one flux 
distribution is not predictive of Homo sapiens 
physiology, then adding a reaction to or deleting a 
reaction from the data structure and repeating step 

20 (e), if the at least one flux distribution is 

predictive of Homo sapiens physiology, then storing the 
data structure in a computer readable medium or media. 

Information to be included in a data 
structure of the invention can be gathered from a 
25 variety of sources including, for example, annotated 

genome sequence information and biochemical literature. 



Sources of annotated human genome sequence 
information include, for example, KEGG, SWISS-PROT, 
LocusLink, the Enzyme Nomenclature database, the 
30 International Human Genome Sequencing Consortium and 
commercial databases. KEGG contains a broad range of 
information, including a substantial amount of 
metabolic reconstruction. The genomes of 63 organisms 
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can be accessed here, with gene products grouped by 
coordinated functions, often represente<d by a map 
(e.g., the enzymes involved in glycolysis would be 
grouped together) . The maps are biochemical pathway 
5 templates which show enzymes connecting metabolites for 
various parts of metabolism. These general pathway 
templates are customized for a given organism by 
highlighting enzymes on a given template which have 
been identified in the genome of the organism. Enzymes 

10 and metabolites are active and yield useful information 
about stoichiometry, structure, alternative names and 
the like, when accessed, 
•i • 

SWISS-PROT contains detailed information 
about protein function. Accessible information 

15 includes alternate gene and gene product names, 

function, structure and sequence information, relevant 
literature references, and the like. 

LocusLink contains general information about 
the locus where the gene is located and, of relevance, 
20 tissue specificity, cellular location, and implication 
of the gene product in various disease states. 

The Enzyme Nomenclature database can be used 
to compare the gene products of two organisms. Often 
the gene names for genes with similar functions in two 

25 or more organisms are unrelated. When this is the 

case, the E.C. (Enzyme Commission) numbers can be used 
as unambiguous indicators of gene product function. 
The information in the Enzyme Nomenclature database is 
also published in Enzyme Nomenclature (Academic Press, 

30 San Diego, California, 1992) with 5 supplements to 

date, all found in the European Journal of Biochemistry 
(Blackwell Science, Maiden, MA) . 
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Sources of biochemical information include, 
for example, general resources relating to metabolism r 
resources relating specifically to human metabolism, 
and resources relating to the biochemistry, physiology 
5 and pathology of specific human cell types. 

Sources of general information relating to 
metabolism, which were used to generate the human 
reaction databases and models described herein, were 
J.G. Salway, Metabolism at a Glance , 2 nd ed., Blackwell 

10 Science, Maiden, MA (1999) and T.M. Devlin, ed. , 

Textbook of Biochemistry with Clinical Correlations , 
4 th ed., John Wiley and Sons, New York, NY (1997). 
Human metabolism-specific resources included J.R. 
Bronk, Human Metabolism: Functional Div ersity and 

15 Integration , Addison Wesley Longman, Essex, England 
(1999) . 

The literature used in conjunction with the 
skeletal muscle metabolic models and simulations 
described herein included R. Maughan et al., 

20 Biochemistry of Exercise and Training , Oxford 

University Press, Oxford, England (1997), as well as 
references on muscle pathology such as S. Carpenter et 
al., Pathology of Skeletal Muscle , 2 Dd ed. , Oxford 
University Press, Oxford, England (2001), and more 

25 specific articles on muscle metabolism as may be found 
in the Journal of Physiology (Cambridge University 
Press, Cambridge, England) . 

In the course of developing an in silico 
30 model of Homo sapiens metabolism, the types of data 
that can be considered include, for example, 
biochemical information which is information related to 
the experimental characterization of a chemical 
reaction, often directly indicating a protein (s) 
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associated with a reaction and the stoichiometry of the 
reaction or indirectly demonstrating the existence of a 
reaction occurring within a cellular extract; genetic 
information, which is information related to the 
5 experimental identification and genetic 

characterization of a gene(s) shown to code for a 
particular protein (s) implicated in carrying out a 
biochemical event; genomic information, which is 
information related to the identification of an open 

10 reading frame and functional assignment, through 

computational sequence analysis, that is then linked to 
a protein performing a biochemical event; physiological 
information, which is information related to overall 
cellular physiology, fitness characteristics, substrate 

15 utilization, and phenotyping results, which provide 
evidence of the assimilation or dissimilation of a 
compound used to infer the presence of specific 
biochemical event (in particular translocations); and 
modeling information, which is information generated 

20 through the course of simulating activity of Homo 
sapiens cells using methods such as those described 
herein which lead to predictions regarding the status 
of a reaction such as whether or not the reaction is 
required to fulfill certain demands placed on a-. 

25 metabolic network. Additional information relevant to 
multicellular organisms that can be considered includes 
cell type-specific or condition-specific gene 
expression information, which can be determined 
experimentally, such as by gene array analysis or from 

30 expressed sequence tag (EST) analysis, or obtained from 
the biochemical and physiological literature. 

The majority of the reactions occurring in 
Homo sapiens reaction networks are catalyzed by 
enzymes/proteins, which are created through the 
35 transcription and translation of the genes found within 
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the chromosome in the cell. The remaining reactions 
occur either spontaneously or through non-enzymatic 
processes. Furthermore, a reaction network data 
structure can contain reactions that add or delete 
5 steps to or from a particular reaction pathway. For 
example, reactions can be added to optimize or improve 
performance of a Homo sapiens model in view of 
empirically observed activity. Alternatively, 
reactions can be deleted to remove intermediate steps 

10 in a pathway when the intermediate steps are not 
necessary to model flux through the pathway. For 
example, if a pathway contains 3 nonbranched steps, the 
reactions can be combined or added together to give a 
net reaction, thereby reducing memory required to store 

15 the reaction network data structure and the 

computational resources required for manipulation of 
the data structure . 

The reactions that occur due to the activity 
of gene-encoded enzymes can be obtained from a genome 

20 database which lists genes identified from genome 

sequencing and subsequent genome annotation. Genome 
annotation consists of the locations of open reading 
frames and assignment of function from homology to 
other known genes or empirically determined activity. 

25 Such a genome database can be acquired through public 
or private databases containing annotated Homo sapiens 
nucleic acid or protein sequences. If desired, a model 
developer can perform a network reconstruction and 
establish the model content associations between the 

30 genes, proteins, and reactions as described, for 
example, in Covert et al. Trends in Biochemical 
gciences 26:179-186 (2001) and Palsson, WO 00/46405. 



As reactions are added to a reaction network 
35 data structure or metabolic reaction database, those 
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having known or putative associations to the 
proteins /enzymes which enable/catalyze the reaction and 
the associated genes that code for these proteins can 
be identified by annotation. Accordingly f the 
5 appropriate associations for all of the reactions to 
their related proteins or genes or both can be 
assigned. These associations can be used to capture 
the non-linear relationship between the genes and 
proteins as well as between proteins and reactions. In 

10 some cases one gene codes for one protein which then 
perform one reaction. However, often there are 
multiple genes which are required to create an active 
enzyme complex and often there are multiple reactions 
that can be carried out by one protein or multiple 

15 proteins that can carry out the same reaction. These 
associations capture the logic (i.e. AND or OR 
relationships) within the associations. Annotating a 
metabolic reaction database with these associations can 
allow the methods to be used to determine the effects 

20 of adding or eliminating a particular reaction not only 
at the reaction level, but at the genetic or protein 
level in the context of running a simulation or 
predicting Homo sapiens activity. 

A reaction network data structure of the 
25 invention can be used to determine the activity of one 
or more reactions in a plurality of Homo sapiens 
reactions independent of any knowledge or annotation of 
the identity of the protein that performs the reaction 
or the gene encoding the protein. A model that is 
30 annotated with gene or protein identities can include 
reactions for which a protein or encoding gene is not 
assigned. While a large portion of the reactions in a 
cellular metabolic network are associated with genes in 
the organism's genome, there are also a substantial 
35 number of reactions included in a model for which there 



WO 03/082214 PCT/DS03/09751 

36 

are no known genetic associations. Such reactions can 
be added to a reaction database based upon other 
information that is not necessarily related to genetics 
such as biochemical or cell based measurements or 
theoretical considerations based on observed 
biochemical or cellular activity. For example, there 
are many reactions that can either occur spontaneously 
or are not protein-enabled reactions. Furthermore, the 
occurrence of a particular reaction in a cell for which 
no associated proteins or genetics have been currently 
identified can be indicated during the course of model 
building by the iterative model building methods of the 
invention. 

The reactions in a reaction network data 
structure or reaction database can be assigned to 
subsystems by annotation, if desired. The reactions 
can be subdivided according to biological criteria, 
such as according to traditionally identified metabolic 
pathways (glycolysis, amino acid metabolism and the 
like) or according to mathematical or computational 
criteria that facilitate manipulation of a model that 
incorporates or manipulates the reactions. Methods and 
criteria for subdviding a reaction database are • 
described in further detail in Schilling et al., J. 
Theor. Biol. 203:249-283 (2000-) , and in Schuster et 
al., Bioinformatics 18:351-361 (2002). The use of 
subsystems can be advantageous for a number of analysis 
methods, such as extreme pathway analysis, and can make 
the management of model content easier. Although 
assigning reactions to subsystems can be achieved 
without affecting the use of the entire model for 
simulation, assigning reactions to subsystems can allow 
a user to search for reactions in a particular 
subsystem which may be useful in performing various 
types of analyses. Therefore, a reaction network data 
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structure can include any number of desired subsystems 
including, for example, 2 or more subsystems, 5 or more 
subsystems, 10 or more subsystems, 25 or more 
subsystems or 50 or more subsystems. 

5 The reactions in a 'reaction network data 

structure or metabolic reaction database can be 
annotated with a value indicating the confidence with 
which the reaction is believed to occur in the Homo 
sapiens cell. The level of confidence can be, for 
10 example, a function of the amount and form of 

supporting data that is available. This data can come 
in various forms including published literature, 
documented experimental results, or results of 
computational analyses. Furthermore, the data can 

15 provide direct or indirect evidence for the existence 
of a chemical reaction in a cell based on genetic, 
biochemical, and/or physiological data. 

The invention further provides a computer 
readable medium, containing (a) a data structure 

20 relating a plurality of Homo sapiens reactants to a 
plurality of Homo sapiens reactions, wherein each of 
the Homo sapiens reactions includes a reactant 
identified as a substrate of the reaction, a reactant 
identified as a product of the reaction and a 

25 stoichiometric coefficient relating the substrate and 
the product, and (b) a constraint set for the plurality 
of Homo sapiens reactions. 

Constraints can be placed on the value of any 
of the fluxes in the metabolic network using a 
30 constraint set. These constraints can be 

representative of a minimum or maximum allowable flux 
through a given reaction, possibly resulting from a 
limited amount of an enzyme present. Additionally, the 
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constraints can determine the direction or 
reversibility of any of the reactions or transport 
fluxes in the reaction network data structure. Based 
on the in vivo environment where Homo sapiens lives the 
5 metabolic resources available to the cell for 
biosynthesis of essential molecules for can be 
determined. Allowing the corresponding transport 
fluxes to be active provides the in silico Homo sapiens 
with inputs and outputs for substrates and by-products 
10 produced by the metabolic network. 

Returning to the hypothetical reaction 
network shown in Figure 1, constraints can be placed on 
each reaction in the exemplary format shown in Figure 
2, as follows. The constraints are provided in a 
15 format that can be used to constrain the reactions of 
the stoichiometric matrix shown in Figure 3. The 
format for the constraints used for a matrix or in 
linear programming can be conveniently represented as a 
linear inequality such as 

20 bj <; Vj * aj :j = 1 n (Eq. 1) 

where Vj is the metabolic flux vector , bj is the 
minimum flux value and a$ is the maximum flux value. 
Thus, a 3 can take on a finite value representing a 
maximum allowable flux through a given reaction or h$ 

25 can take on a finite value representing minimum 

allowable flux through a given reaction. Additionally , 
if one chooses to leave certain reversible reactions' or 
transport fluxes to operate in a forward and reverse 
manner the flux may remain unconstrained by setting bj 

30 to negative infinity and a 3 to positive infinity as 
shown for reaction R 2 in Figure 2. If reactions 
proceed only in the forward reaction b^ is set to zero 
while a 1 is set to positive infinity as shown for 
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reactions R lf R 3f R 4 , R 5/ and R 6 in Figure 2. As an 
example, to simulate the event of a genetic deletion or 
non-expression of a particular protein, the flux 
through all of the corresponding metabolic reactions 
5 related to the gene or protein in question are reduced 
to zero by setting a 3 and b 5 to be zero. Furthermore, 
if one wishes to simulate the absence of a particular 
growth substrate one can simply constrain the 
corresponding transport fluxes that allow the 
10 metabolite to enter the cell to be zero by setting a^ 
and bj to be zero. On the other hand if a substrate is 
only allowed to enter or exit the cell via transport 
mechanisms, the corresponding fluxes can be properly 
constrained to reflect this scenario. 

15 The ability of a reaction to be actively 

occurring is dependent on a large number of additional 
factors beyond just the availability of substrates. 
These factors, which can be represented as variable 
constraints in the models and methods of the invention 

20 include, for example, the presence of cof actors 

necessary to stabilize the protein/enzyme, the presence 
or absence of enzymatic inhibition and activation 
factors, the active formation of the protein/enzyme 
through translation of the corresponding mRNA 

25 transcript, the transcription of the associated gene(s) 
or the presence of chemical signals and/or proteins 
that assist in controlling these processes that 
ultimately determine whether a chemical reaction is 
capable of being carried out within an organism. Of 

30 particular importance in the regulation of human cell 
types is the implementation of paracrine and endocrine 
signaling pathways to control cellular activities. In 
these cases a cell secretes signaling molecules that 
may be carried far afield to act on distant targets 

35 (endocrine signaling), or act as local mediators 
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(paracrine signaling) . Examples of endocrine signaling 
molecules include hormones such as insulin, while 
examples of paracrine signaling molecules include 
neurotransmitters such as acetylcholine- These 
5 molecules induce cellular responses through signaling 
cascades that affect the activity of biochemical 
reactions in the cell. Regulation can be represented 
in an in silico Homo sapiens model by providing a 
variable constraint as set forth below. 

10 

Thus, the invention provides a computer 
readable medium or media, including (a) a data 
structure relating a plurality of Homo sapiens 
reactants to a plurality of Homo sapiens reactions, 

15 wherein each of the reactions includes a reactant 

identified as a substrate of the reaction, a reactant 
identified as a product of the reaction and a 
stoichiometric coefficient relating the substrate and 
the product, and wherein at least one of the reactions 

20 is a regulated reaction; and (b) a constraint set for 
the plurality of reactions, wherein the constraint set 
includes a variable constraint for the regulated 
reaction. 

As used herein, the term "regulated," when 
25 used in reference to a reaction in a data structure, is 
intended to mean a reaction that experiences an altered 
flux due to a change in the value of a constraint or a 
reaction that has a variable constraint. 

As used herein, the term "regulatory 
30 reaction" is intended to mean a chemical conversion or 
interaction that alters the activity of a protein, 
macromolecule or enzyme. A chemical conversion or 
interaction can directly alter the activity of a 
protein, macromolecule or enzyme such as occurs when 
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the protein, macromolecule or enzyme is 
post-translationally modified or can indirectly alter 
the activity of a protein, macromolecule or enzyme such 
as occurs when a chemical conversion or binding event 
5 leads to altered expression of the protein, 

macromolecule or enzyme. Thus, transcriptional or 
translational regulatory pathways can indirectly alter 
a protein, macromolecule or enzyme or an associated 
reaction. Similarly, indirect regulatory reactions can 

10 include reactions that occur due to downstream 

components or participants in a regulatory reaction 
network. When used in reference to a data structure or 
in silico Homo sapiens model, the term is intended to 
mean a first reaction that is related to a second 

15 reaction by a function that alters the flux through the 
second reaction by changing the value of a constraint 
on the second reaction. 

As used herein, the term "regulatory data 
structure" is intended to mean a representation of an 

20 event, reaction or network of reactions that activate 
or inhibit a reaction, the representation being in a 
format that can be manipulated or analyzed. An event 
that activates a reaction can be an event that 
initiates the reaction or an event that increases the 

25 rate or level of activity for the reaction. An event 
that inhibits a reaction can be an event that stops the 
reaction or an event that decreases the rate or level 
of activity for the reaction. Reactions that can be 
represented in a regulatory data structure include, for 

30 example, reactions that control expression of a 

macromolecule that in turn, performs a reaction such as 
transcription and translation reactions, reactions that 
lead to post translational modification of a protein or 
enzyme such as phophorylation, dephosphorylation, 

35 prenylation, methylation, oxidation or covalent 
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modification, reactions that process a protein or 
enzyme such as removal of a pre- or pro-sequence , 
reactions that degrade a protein or enzyme or reactions 
that lead to assembly of a protein or enzyme, 

5 As used herein, the term "regulatory event" 

is intended to mean a modifier of the flux through a 
reaction that is independent of the amount of reactants 
available to the reaction. A modification included in 
the term can be a change in the presence, absence , or 

10 amount of an enzyme that performs a reaction. A 
modifier included in the term can be a regulatory' 
reaction such as a signal transduction reaction or an 
environmental condition such as a change in pH,' 
temperature, redox potential or time. It will be 

15 understood that when used in reference to an in silico 
Homo sapiens model or data structure a regulatory event 
is intended to be a representation of a modifier of the 
flux through a Homo sapiens reaction that is 
independent of the amount of reactants available to the 

20 reaction. 



The effects of regulation on one or more 
reactions that occur in Homo sapiens can be predicted 
using an in silico Homo sapiens model of the invention. 

25 Regulation can be taken into consideration in the 
context of a particular condition being examined by 
providing a variable constraint for the reaction in an 
in silico Homo sapiens model. Such constraints 
constitute condition-dependent constraints. A data 

30 structure can represent regulatory reactions as Boolean 
logic statements (Reg-reaction) . The variable takes on 
a value of 1 when the reaction is available for use in 
the reaction network and will take on a value of 0 if 
the reaction is restrained due to some regulatory 

35 feature. A series of Boolean statements can then be 



WO 03/082214 PCTYUS03/09751 

43 

introduced to mathematically represent the regulatory 
network as described for example in Covert et al. J. 
Theor; Biol. 213:73-88 (2001). For example, in the 
case of a transport reaction (A_in) that imports 
5 metabolite A, where metabolite A inhibits reaction R2 
as shown in Figure 4, a Boolean rule can state that: 

Reg-R2 = IF NOT (A_in) . (Eq. 2) 

This statement indicates that reaction R2 can occur if 
reaction A_in is not occurring (i.e. if metabolite A is 
not present) . Similarly, it is possible to assign the 
regulation to a variable A which would indicate an 
amount of A above or below a threshold that leads to 
the inhibition of reaction R2. Any function that 
provides values for variables corresponding to each of 
the reactions in the biochemical reaction network can 
be used to represent a regulatory reaction or set of 
regulatory reactions in a regulatory data structure. 
Such functions can include, for example, fuzzy logic, 
heuristic rule-based descriptions, differential 
equations or kinetic equations detailing system 
dynamics . 

A reaction constraint placed on a reaction 
can be incorporated into-an in silico Homo sapiens 
25 model using the following general equation: 

(Reg-Reaction) *b 3 z v 3 * a 3 * (Reg-Reaction) 

: (Eq. 3) 

j = 1. — n 

For the example of reaction R2 this equation is written 
30 as follows: 



10 



15 



20 



(0)*Reg-R2 * R2 s (~)*Reg-R2. (Eq. 4) 
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Thus, during the course of a simulation, depending upon 
the presence or absence of metabolite A in the interior 
of the cell where reaction R2 occurs, the value for the 
upper boundary of flux for reaction R2 will change from 
5 0 to infinity, respectively. 

With the effects of a regulatory event or 
network taken into consideration by a constraint 
function and the condition-dependent constraints set to 
an initial relevant value, the behavior of the Homo 
10 sapiens reaction network can be simulated for the 
conditions considered as set forth below. 

Although regulation has been exemplified 
above for the case where a variable constraint is 
dependent upon the outcome of a reaction in the data 
15 structure, a plurality of variable constraints can be 
included in an in silico Homo sapiens model to 
represent regulation of a plurality of reactions. 
Furthermore, in the exemplary case set forth above, the 
regulatory structure includes a general control stating 

20 that a reaction is inhibited by a particular 

environmental condition. Using a general control of 
this type, it is possible to incorporate molecular 
mechanisms and additional detail into the regulatory 
structure that is responsible for determining the 

25 active nature of a particular chemical reaction within 
an organism. 

Regulation can also be simulated by a model 
of the invention and used to predict a Homo sapiens 
physiological function without knowledge of the precise 
30 molecular mechanisms involved in the reaction network 
being modeled. Thus, the model can be used to predict, 
in silico , overall regulatory events or causal 
relationships that are not apparent from in vivo 
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observation of any one reaction in a network or whose 
in vivo effects on a particular reaction are not known. 
Such overall regulatory effects can include those that 
result from overall environmental conditions such as 
changes in pH, temperature, redox potential, or the 
passage of time. 

The in silico Homo sapiens model and methods 
described herein can be implemented on any conventional 
host computer system, such as those based on Intel. RTM. 
microprocessors and running Microsoft Windows operating 
systems. Other systems, such as those using the UNIX or 
LINUX operating system and based on IBM. RTM. , DEC. RTM. 
or Motorola. RTM. microprocessors are also contemplated. 
The systems and methods described herein can also be 
implemented to run on client-server systems and 
wide-area networks, such as the Internet. 

Software to implement a method or model of 
the invention can be written in any well-known computer 
language, such as Java, C, C++, Visual Basic, FORTRAN 
or COBOL and compiled using any well-known compatible 
compiler. The software of the invention normally runs 
from instructions stored in a memory on a host computer 
system. A memory or computer readable medium can be a t 
hard disk, floppy disc, compact disc, magneto-optical 
disc, Random Access Memory, Read Only Memory or Flash 
Memory. The memory or computer readable medium used in 
the invention can be contained within a single computer 
or distributed in a network. A network can be any of a 
number of conventional network systems known in the art 
.such as a local area network (LAN) or a wide area 
network (WAN) . Client-server environments, database 
servers and networks that can be used in the invention 
are well known in the art. For example, the database 
server can run on an operating system such as UNIX, 
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running a relational database management system, a 
World Wide Web application and a World Wide Web server. 
Other types of memories and computer readable media are 
also contemplated to function within the scope of the 
5 invention. 

A database or data structure of the invention 
can be represented in a markup language format 
including, for example, Standard Generalized Markup 
Language (SGML), Hypertext markup language (HTML) or 

10 Extensible Markup language (XML) . Markup languages can 
be used to tag the information stored in a database or 
data structure of the invention, thereby providing 
convenient annotation and transfer of data between 
databases and data structures. In particular, an XML 

15 format can be useful for structuring the data 

representation of reactions, reactants and their 
annotations; for exchanging database contents, for 
example, over a network or internet; for updating 
individual elements using the document object model; or 

20 " for providing differential access to multiple users for 
different information content of a data base or data 
structure of the invention. XML programming methods 
and editors for writing XML code are known in the art 
as described, for example, in Ray, "Learning XML" 

25 O'Reilly and Associates, Sebastopol, CA (2001). 

A set of constraints can be applied to a 
reaction network data structure to simulate the flux of 
mass through the reaction network under a particular 
set of environmental conditions specified by a 
30 constraints set. Because the time constants 

characterizing metabolic transients and/or metabolic 
reactions are typically very rapid, on the order pf 
milli-seconds to seconds, compared to the time 
constants of cell growth on the order of hours to days, 
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the transient mass balances can be simplified to only 
consider the steady state behavior. Referring now to an 
example where the reaction network data structure is a 
stoichiometric matrix, the steady state mass balances 
5 can be applied using the following system of linear 
equations 

S • v = 0 (Eq. 5) 

where S is the stoichiometric matrix as defined above 
and v is the flux vector. This equation defines the 

10 mass, energy, and redox potential constraints placed on 
the metabolic network as a result of stoichiometry . 
Together Equations 1 and 5 representing the reaction 
constraints and mass balances, respectively, 
effectively define the capabilities and constraints of 

15 the metabolic genotype and the organism's metabolic 

potential. All vectors, v, that satisfy Equation 5 are 
said to occur in the mathematical nullspace of S. 
Thus, the null space defines steady-state metabolic 
flux distributions that do not violate the mass, 

20 energy, or redox balance constraints. Typically, the 
number of fluxes is greater than the number of mass 
balance constraints, thus a plurality of flux 
distributions satisfy the mass balance constraints and 
occupy the null space. The null space, which defines 

25 the feasible set of metabolic flux distributions, is 
further reduced in size by applying the reaction 
constraints set forth in Equation 1 leading to a 
defined solution space. A point in this space 
represents a flux distribution and hence a metabolic 

30 phenotype for the network. An optimal solution within 
the set of all solutions can be determined using 
mathematical optimization methods when provided with a 
stated objective and a constraint set. The calculation 
of any solution constitutes a simulation of the model. 
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Objectives for activity of a human cell can 
be chosen. While the overall objective of a 
multi-cellular organism may be growth or reproduction, 
individual human cell types generally have much more 
5 complex objectives, even to the seemingly extreme 

objective of apoptosis (programmed cell death) , which 
may benefit the organism but certainly not the 
individual cell. For example, certain cell types may 
have the objective of maximizing energy production, 

10 while others have the objective of maximizing the 
production of a particular hormone f extracellular 
matrix component, or a mechanical property such as 
contractile force. In cases where cell reproduction is 
slow, such as human skeletal muscle, growth and its 

15 effects need not be taken into account- In other 
cases, biomass composition and growth rate could be 
incorporated into a "maintenance" type of flux, where 
rather than optimizing for growth, production of 
precursors is set at a level consistent with 

20 experimental knowledge and a different objective is 
optimized. 

Certain cell types, including cancer cells, 
can be viewed as having an objective of maximizing cell 
growth. Growth can be defined in terms of biosynthetic 

25 requirements based on literature values of biomass 

composition or experimentally determined values such as 
those obtained as described above. Thus, biomass 
generation can be defined as an exchange reaction that 
removes intermediate metabolites in the appropriate 

30 ratios and represented as an objective function. In 
addition to draining intermediate metabolites this 
reaction flux can be formed to utilize energy molecules 
such as ATP, NADH and NADPH so as to incorporate any 
maintenance requirement that must be met. This new 

35 reaction flux then becomes another constraint/balance 
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equation that the system must satisfy as the objective 
function. Using the stoichiometric matrix of Figure 3 
as an example, adding such a constraint is analogous to 
adding the additional column V growth to the 
5 stoichiometric matrix to represent fluxes to describe 
the production demands placed on the metabolic system. 
Setting this new flux as the objective function and 
asking the system to maximize the value of this flux 
for a given set of constraints on all the other fluxes 
10 is then a method to simulate the growth of the 
organism. 

Continuing with the example of the 
stoichiometric matrix applying a constraint set to a 
reaction network data structure can be illustrated as 
15 follows. The solution to equation 5 can be formulated 
as an optimization problem, in which the flux 
distribution that minimizes a particular objective is 
found. Mathematically, this optimization problem can 
be stated as: ^ 

20 Minimize Z (Eq. 6) 

where z = £ C. - v, (Eq. 7 ) 

where Z is the objective which is represented as a 
25 linear combination of metabolic fluxes v* using the 
weights Ci in this linear combination. The 
optimization problem can also be stated as the 
equivalent maximization problem; i.e. by changing the 
sign on Z. Any commands for solving the optimazation 
30 problem can be used including, for example, linear 
programming commands . 



WO 03/082214 PCT/US03/09751 

50 

A computer system of the invention can 
further include a user interface capable of receiving a 
representation of one or more reactions, A user 
interface of the invention can also be capable of 
sending at least one command for modifying the data 
structure, the constraint set or the commands for 
applying the constraint set to the data representation, 
or a combination thereof. The interface can be a 
graphic user interface having graphical means for 
making selections such as menus or dialog boxes. The 
interface can be arranged with layered screens 
accessible by making selections from a main screen. 
The user interface can provide access to other 
databases useful in the invention such as a metabolic 
reaction database or links to other databases having 
information relevant to the reactions or reactants in 
the reaction network data structure or to Homo sapiens 
physiology. Also, the user interface can display a 
graphical representation of a reaction network or the 
results of a simulation using a model of the invention. 

Once an initial reaction network data 
structure and set of .constraints has been created, this 
model can be tested by preliminary simulation. During 
preliminary simulation, gaps in the network or 
"dead-ends" in which a metabolite can be produced but 
not consumed or where a metabolite can be consumed but 
not produced can be identified. Based on the results 
of preliminary simulations areas of the metabolic 
reconstruction that require an additional reaction can 
be identified. The determination of these gaps can be 
readily calculated through appropriate queries of the 
reaction network data structure and need not require 
the use of simulation strategies, however, simulation 
would be an alternative approach to locating such gaps. 
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In the preliminary simulation testing and 
model content refinement stage the existing model is 
subjected to a series of functional tests to determine 
if it can perform basic requirements such as the o 
5 ability to produce the required biomass constituents 
and- generate predictions concerning the basic 
physiological characteristics of the particular cell 
type being modeled. The more preliminary testing that 
is conducted the higher the quality of the model that 
10 will be generated- Typically, the majority of the 

simulations used in this stage of development will be 
single optimizations. A single optimization can be 
used to calculate a single flux distribution 
demonstrating how metabolic resources are routed 
15 determined from the solution to one optimization 

problem. An optimization problem can be solved using 
linear programming as demonstrated in the Examples 
below. The result can be viewed as a display of a flux 
distribution on a reaction map. Temporary reactions 
20 can be added to the network to determine if they should 
be included into the model based on modeling/simulation 
requirements . 

Once a model of the invention is sufficiently 
complete with respect to the content of the reaction 

25 network data structure according to the criteria set 

forth above, the model can be used to simulate activity 
of one or more reactions in a reaction network. The 
results of a simulation can be displayed in a variety 
of formats including, for example, a table, graph, 

30 reaction network, flux distribution map or a phenotypic 
phase plane graph. 



Thus, the invention provides a method for 
predicting a Homo sapiens physiological function. The 
method includes the steps of (a) providing a data 
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structure relating a plurality of Homo sapiens 
reactants to a plurality of Homo sapiens reactions, 
wherein each of the Homo sapiens reactions includes a 
reactant identified as a substrate of the reaction, a 
5 reactant identified as a product of the reaction and a 
stoichiometric coefficient relating said substrate and 
said product; (b) providing a constraint set for the 
plurality of Homo sapiens reactions; (c) providing an 
objective function, and (d) determining at least one 
10 flux distribution that minimizes or maximizes the 

objective function when the constraint set is applied 
to the data structure, thereby predicting a Homo 
sapiens physiological function. 

A method for predicting a Homo sapiens 
15 physiological function can include the steps of (a) 

providing a data structure relating a plurality of Homo 
. sapiens reactants to a plurality of Homo sapiens 
reactions, wherein each of the Homo sapiens reactions 
includes a reactant identified as a substrate of the 
20 reaction, a reactant identified as a product of the 

reaction and a stoichiometric coefficient relating the 
substrate and the product, and wherein at least one of 
the reactions is a regulated reaction; (b) providing a 
constraint set for the plurality of reactions, wherein 
25 the constraint set includes a variable constraint for 
the regulated reaction; (c) providing a 
condition-dependent value to the variable constraint; 
(d) providing an objective function, and (e) 
determining at least one flux distribution that 
30 minimizes or maximizes the objective function when the 
constraint set is applied to the data structure, 
thereby predicting a Homo sapiens physiological 
function. 
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As used herein, the term "physiological 
function/' when used in reference to Homo sapiens r is 
intended to mean an activity of a Homo sapiens cell as 
a whole. An activity included in the term can be the 
5 magnitude or rate of a change from an initial state of 
a Homo sapiens cell to a final state of the Homo 
sapiens cell. An activity included in the term can be, 
for example, growth, energy production, redox 
equivalent production, biomass production, development, 
10 or consumption of carbon nitrogen, sulfur, phosphate, 
hydrogen or oxygen. An activity can also be an output 
of a particular reaction that is determined or 
predicted in the context of substantially all of the 
reactions that affect the particular reaction in a Homo 
15 sapiens cell or substantially all of the reactions that 
occur in a Homo sapiens cell (e.g. muscle contraction). 
Examples of a particular reaction included in the term 
are production of biomass precursors, production of a 
protein, production of an amino acid, production of a 
20 purine, production of a pyrimidine, production of a 
lipid, production of a fatty acid, production of a 
cof actor or transport of a metabolite. A physiological 
function can include an emergent property which emerges 
from the whole but not from the sum of parts where the 
25 parts are observed in isolation (see for example, 
Palsson, Nat. Biotech 18 : 1147-1150 - (2000) ) . 

A physiological function of Homo sapiens 
reactions can be determined using phase plane analysis 
of flux distributions. Phase planes are 
30 representations of the feasible set which can be 

presented in two or three dimensions. As an example, 
two parameters that describe the growth conditions such 
as substrate and oxygen uptake rates can be defined as 
two axes of a two-dimensional space. The optimal flux 



WO 03/082214 

54 



PCT/USD3 /PCTAJS03/09751 



distribution can be calculated from a reaction network 
data structure and a set of constraints as set forth 
above for all points in this plane by repeatedly 
solving the linear programming problem while adjusting 
5 the exchange fluxes defining the two-dimensional space. 
A finite number of qualitatively different metabolic 
pathway utilization patterns can be identified in such 
a plane, and lines can be drawn to demarcate these 
regions. The demarcations defining the regions can be 
10 determined using shadow prices of linear optimization 
as described, for example inChvatal, Linear 
^rogrMmm New York, W.H. Freeman and Co. (1983). The 
regions are referred to as regions of constant shadow 
price structure. The shadow prices define the 
15 intrinsic value of each reactant toward the objective 
function as a nunber that is either negative, zero, or 
positive and are graphed according to the uptake rates 
represented by the x and y axes. When the shadow 
prices become zero as the value of the uptake rates are 
20 changed there is a qualitative shift in the optimal 
reaction network. 

One demarcation line in the phenotype phase 
plane is defined as the line of optimally (LO) . This 
line represents the optimal relation between respective 
25 metabolic fluxes. The LO can be- identified by varying 
the x-axis flux and calculating the optimal y-axis flux 
with the objective function defined as the growth flux 

From the phenotype phase plane analysis the 
conditions under which a desired activity is optimal 
30 can be determined. The maximal uptake rates lead to 

the definition of a finite area of the plot that is the 
predicted outcome of a reaction network within the 
environmental conditions represented by the constraint 
set Similar analyses can be performed in multiple 
35 dimensions where each dimension on the plot corresponds 



WO 03/082214 PCX /" U S O 3 /PCT/US03/09751 

55 

to a different uptake rate. These and other methods 
for using phase plane analysis, such as those described 
in Edwards et al., nintech Bioena. 77:27-36(2002), can 
be used to analyze the results of a simulation using an 
5 in silico Homo sapiens model of the invention. 

A physiological function of Homo sapiens can 
also be determined using a reaction map to display a 
flux distribution. A reaction map of Homo sapiens can 
be used to view reaction networks at a variety of 

10 levels. In the case of a cellular metabolic reaction 
network a reaction map can contain the entire reaction 
complement representing a global perspective. 
Alternatively, a reaction map can focus on a particular 
region of metabolism such as a region corresponding to 

15 a reaction subsystem described above or even on an 
individual pathway or reaction. 

Thus, the invention provides an apparatus 
that produces a representation of a Homo sapiens 
physiological function, wherein the representation is 
20 produced by a process including the steps of: (a) 

providing a data structure relating a plurality of Homo 
sapiens reactants to a plurality of Homo sapiens 
reactions, wherein each of the Homo sapiens reactions 
includes a reactant identified as a substrate of the 
25 reaction, a reactant identified as a product of the 

reaction and a stoichiometric coefficient relating said 
substrate and said product; (b) providing a constraint 
set % -for the plurality of Homo sapiens reactions; (c) 
providing an objective function; (d) determining at 
30 least one flux distribution that minimizes or maximizes 
the objective function when the constraint set is 
applied to the data structure, thereby predicting a 
Homo sapiens physiological function, and (e) producing 
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a 



representation of the activity of the one or 



more 



Homo sapiens reactions. 

The methods of the invention can be used to 
determine the activity of a plurality of Homo sapiens 

5 reactions including, for example, biosynthesis of an 
amino acid, degradation of an amino acid, biosynthesis 
of a purine, biosynthesis of a pyrimidine, biosynthesis 
of a lipid, metabolism of a fatty acid, biosynthesis of 
a cofactor, transport of a metabolite and metabolism of 

10 an alternative carbon source. In addition, the methods 
can be used to determine the activity of one or more of 
the reactions described above or listed in Table 1. 

The methods of the invention can be used to 
determine a phenotype of a Homo sapiens mutant. The 
15 activity of one or more Homo sapiens reactions can be 
determined using the methods described above, wherein 
the reaction network data structure lacks one or more 
gene-associated reactions that occur in Homo sapiens. 
Alternatively, the methods can be used to determine the 
20 activity of one or more Homo sapiens reactions when a 
reaction that does not naturally occur in Homo sapiens 
is added to the reaction network data structure. 
Deletion of a gene can also be represented in a model 
of the invention by constraining the flux through the 
25 reaction to zero, thereby allowing the reaction to 
remain within the data structure. Thus, simulations 
can be made to predict the effects of adding or 
removing genes to or from Homo sapiens. The methods 
can be particularly useful for determining the effects 
30 of adding or deleting a gene that encodes for a gene 
product that performs a reaction in a peripheral 
metabolic pathway. 
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A drug target or target for any other agent 
that affects Homo sapiens function can be predicted 
using the methods of the invention. Such predictions 
can be made by removing a reaction to simulate total 
5 inhibition or prevention by a drug or agent. 

Alternatively, partial inhibition or reduction in the 
activity a particular reaction can be predicted by 
performing the methods with altered constraints. For 
example, reduced activity can be introduced into a 
10 model of the invention by altering the a 3 or b 3 values 
for the metabolic flux vector of a target reaction to 
reflect a finite maximum or minimum flux value 
corresponding to the level of inhibition.' Similarly, 
the effects of activating a reaction, by initiating or 
15 increasing the activity of the reaction, can be 

predicted by performing the methods with a reaction 
network data structure lacking a particular reaction or 
by altering the a 3 or b 5 values for the metabolic flux 
vector of a target reaction to reflect a maximum or 
20 minimum flux value corresponding to the level of 

activation. The methods can be particularly useful for 
identifying a target in a peripheral metabolic pathway. 

Once a reaction has been identified for which 
activation or inhibition produces a desired effect on 

25 Homo sapiens function, an enzyme or macromolecule that 
performs the reaction in Homo sapiens or a gene that 
expresses the enzyme or macromolecule can be identified 
as a target for a drug or other agent. A candidate 
compound for a target identified by the methods of the 

30 invention can be isolated or synthesized using known 
methods. Such methods for isolating or synthesizing 
compounds can include, for example, rational design 
based on known properties of the target (see, for 
example, DeCamp et al., Protein Engineeri ng Principles 

35 and Practice , Ed. Cleland and Craik, Wiley-Liss, New 
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York, pp. 467-506 (1996)), screening the target against 
combinatorial libraries of compounds (see for example, 
Houghten et al., Nature, 354, 84-86 (1991); Dooley et 
al., Science, 266, 2019-2022 (1994), which describe an 
5 iterative approach, or R. Houghten et al. 

PCT/US91/08694 and U.S. Patent 5,556,762 which describe 
the positional-scanning approach) , or a combination of 
both to obtain focused libraries. Those skilled in the 
art will know or will be able to routinely determine 
10 assay conditions to be used in a screen based on 

properties of the target or activity assays known in 
the art. 

A candidate drug or agent, whether identified 
by the methods described above or by other methods 
15 known in the art, can be validated using an in silico 
Homo sapiens model or method of the invention. The 
effect of a candidate drug, or agent on Homo sapiens 
physiological function can be predicted based on the 
activity for a target in the presence of the candidate 
20 drug or agent measured in vitro or in vivo. This 
activity can be represented in an in silico Homo 
sapiens model by adding a reaction, to the model, 
removing a reaction from the model or adjusting a 
constraint for a reaction in the model to reflect the 
25 measured effect of the candidate drug or agent on the 
activity of the reaction. By running a simulation , 
under these conditions the holistic effect of the 
candidate drug or agent on Homo sapiens physiological 
function can be predicted. 

30 The methods of the invention can be used to 

determine the effects of one or more environmental 
components or conditions on an activity of a Homo 
sapiens cell. As set forth above an exchange reaction 
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can be added to a reaction network data structure 
corresponding to uptake of an environmental component, 
release of a component to the environment, or other 
environmental demand. The effect of the environmental 
5 component or condition can be further investigated by 
running simulations with adjusted a 3 or b 3 values for 
the metabolic flux vector of the exchange reaction 
target reaction to reflect a finite maximum or minimum 
flux value corresponding to the effect of the 
10 environmental component or condition. The 

environmental component can be, for example an 
alternative carbon source or a metabolite that when 
added to the environment of a Homo sapiens cell can be 
taken up and metabolized. The environmental component 
15 can also be a combination of components present for 
example in a minimal medium composition. Thus, the 
methods can be used to determine an optimal or minimal 
medium composition that is capable of supporting a 
particular activity of Homo sapiens. 

20 The invention further provides a method for 

determining a set of environmental components to 
achieve a desired activity for Homo' sapiens. The 
method includes the steps of (a) providing a data 
structure relating a plurality of Homo sapiens 
25 reactants to a plurality of Homo sapiens reactions, 

wherein each of the Homo sapiens reactions includes a 
reactant identified as a substrate of the reaction, a 
reactant identified as a product of the reaction and a 
stoichiometric coefficient relating the substrate and 
30 the product; (b) providing a constraint set for the 
.plurality of Hojno sapiens reactions; (c) applying the 
constraint set to the data representation, thereby 
determining the activity of one or more Homo sapiens 
reactions (d) determining the activity of one or more 
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Homo sapiens reactions according to steps (a) through 
(c) , wherein the constraint set includes an upper or 
lower bound on the amount of an environmental component 
and (e) repeating steps (a) through (c) with a changed 
5 constraint set, wherein the activity determined in step 
(e) is improved compared to the activity determined in 



step (d) . 



The following examples are intended to 
illustrate but not limit the present invention. 



10 



This example shows the construction of a 
universal Homo sapiens metabolic reaction database, a 
Homo sapiens core metabolic reaction database and a . 
Homo sapiens muscle cell metabolic reaction database. 
15 This example also shows the iterative model building 
process used to generate a Homo sapiens core metabolic 
mo del and a Homo sapiens muscle cell metabolic model. 

A universal Homo sapiens reaction database 
was prepared from the genome databases and biochemical 
20 literature. The reaction database shown in Table 1 
contains the ' following information: 

Locus ID - the locus number of the gene found 

in the LocusLink website. 

Gene Ab. - various abbreviations which are 

25 used for the gene. 

Reaction Stoichiometry - includes all 
metabolites and direction of the reaction, as well as 

reversibility- 

E.C. - The Enzyme Commission number. 
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Additional information included in the 
universal reaction database*, although not shown in 
Table 1, included the chapter of Salway, supra (1999), 
where relevant reactions were found; the cellular 
5 location, if the reaction primarily occurs in a given 
compartment; the SWISS PROT identifier, which can be 
used to locate the gene record in SWISS PROT; the full 
name of the gene at the given locus; the chromosomal 
location of the gene; the Mendelian Inheritance in Man 
10 (MIM) data associated with the gene; and the tissue 
type, if the gene is primarily expressed in a certain 
tissue. Overall, 1130 metabolic enzyme- or 
transporter-encoding genes were included in the 
universal reaction database. 



15 Fifty-nine reactions in the universal 

reaction database were identified and included based on 
biological data as found in Salway supra (1999), 
currently without genome annotation. Ten additional 
reactions, not described in the biochemical literature 

20 or genome annotation, were subsequently included in the 
reaction database following preliminary simulation ' 
testing and model content refinement. These 69 
reactions are shown at the end of Table 1. 

From the universal Homo sapiens reaction 
25 database shown in Table 1, a core metabolic reaction 

database was established, which included core metabolic 
reactions as well as some amino acid and fatty acid 
metabolic reactions, as described in Chapters 1, 3, 4, 
7, 9, 10, 13, 17, 18 and 44 of J.G. Salway, Metabolism 
30 at. a Glance , 2 nd ed. , Blackwell Science, Maiden, MA 

(1999). The core metabolic reaction database included 
211 unique reactions, accounting for 737 genes in the 
Homo sapiens genome. The core metabolic reaction 
database was used, although not in its entirety, to 
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create the core 
II. 



metabolic model described in Example 



To allow for the modeling of muscle cells, 
the core reaction database was expanded to include 446 
5 unique reactions, accounting for 889 genes in the Homo 
sapiens genome. This skeletal muscle metabolic 
reaction database was used to create the skeletal 
muscle metabolic model described in Example II- 

Once the core and muscle cell metabolic 
10 reaction databases were compiled, the reactions were 
represented as a metabolic network data structure, or 
"stoichiometric input file." For example, the core 
metabolic network data structure shown in Table 2 
contains 33 reversible reactions, 31 non-reversible 
15 reactions, 97 matrix columns and 52 unique enzymes. 
Each reaction in Table 2 is represented so as to 
indicate the substrate or substrates (a negative 
number) and the product or products (a positive 
number) ; the stoichiometry; the name of each reactron 
20 (the term following the zero) ; and whether the reaction 
is reversible (an R following the reaction name) . A 
metabolite that appears in the mitochondria is 
indicated by an •»,- and a metabolite that appears in 
the extracellular space is indicated by an "ex." 



25 



To perform a preliminary simulation or to 
simulate a physiological condition, a set of inputs and 
outputs has to be defined and the network objectrve 
■ function specified. To calculate the maximum ATP 
production of the Homo sapiens core metabolic network 
30 using glucose as a carbon source, a non-zero uptake 
value for glucose was assigned and ATP production was 
maximized as the objective function, using the 
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representation shown in Table 2. The network's 
performance was examined by optimizing for the. given 
objective function and the set of constraints defined 
in the input file, using flux balance analysis methods. 
5 The model was refined in an iterative manner by 
examining the results of the simulation and 
implementing the appropriate changes. 

Using this iterative procedure, two metabolic 
reaction networks were generated, representing human 
10 core metabolism and human skeletal muscle cell 
metabolism. 



This example shows how human metabolism can 
be accurately simulated using a Homo sapiens core 
15 metabolic model. 

The human core metabolic reaction database 
shown in Table 3 was used in simulations of human core 
metabolism. This reaction database contains a total of 
20 65 reactions, covering the classic biochemical pathways 
of glycolysis, the pentose phosphate pathway, the 
tricitric acid cycle, oxidative phosphorylation, 
glycogen storage, the malate/aspartate shuttle, the 
glycerol phosphate shuttle, and plasma and 
25 mitochondrial membrane transporters. The reaction 
network was divided into three compartments: the 
cytosol, mitochondria, and the extracellular space. 
The total number of metabolites in the network is 50, 
of which 35 also appear in the mitochondria. This core 
30 metabolic network accounts for 250 human genes. 
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To perform simulations using the core 
metabolic network, network properties such as the P/O 
ratio were specified using Salway, sujpra (1999) as a 
reference. Oxidation of NADH through the Electron 
5 Transport System (ETS) was set to generate 2.5 ATP 

molecules (i.e. a P/O ratio of 2.5 for NADH) , and that 
of FADH 2 was set to 1.5 ATP molecules (i.e. a P/O ratio 
of 1-5 for FADH 2 ) . 

Using the core metabolic network, aerobic and 
10 anaerobic metabolisms were simulated in silico. 

Secretion of metabolic by-products was in agreement 
with the known physiological parameters. Maximum yield 
of all 12 precursor-metabolites. ( glucose- 6-phosphate, 
fructose-6-phosphate, ribose-5-phosphate, 
15 erythrose-4-phosphate, triose phosphate, 

3-phosphoglycerate, phosphoenolpyruvate, pyruvate, 
acetyl CoA, a-ketoglutarate, succinyl CoA, and 
oxaloacetate) was examined and none found to exceed the 
values of its theoretical yield. 



20 



Maximum ATP yield was also examined in the 
cytosol and mitochondria. Salway, supr^ (1999) reports 
that in the absence of membrane proton-coupled 
• .transport systems," the energy yield is 38 ATP molecules 
per molecule of glucose and otherwise 31 ATP molecules 
25 per molecule of glucose. The core metabolic model 
demonstrated the same values as described by Salway 
supra (1999) . Energy yield in the mitochondria was 
determined to be 38 molecules of ATP per glucose 
molecule. This is equivalent to production of energy 
30 in the absence of proton-couple transporters across 
mitochondrial membrane since all the protons were 
utilized only in oxidative phosphorylation. In the 
cytosol, energy yield was calculated to be 30.5 
molecules of ATP per glucose molecule. This value 
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reflects the cost of metabolite exchange across the 
mitochondrial membrane as described by SajLway, supra 
(1999). 



EXAMPLE III 



5 This example shows how human muscle cell 

metabolism can be accurately simulated under various 
physiological and pathological conditions using a Homo 
sapiens muscle cell metabolic model. 

As described in Example I, the core metabolic 
10 model was extended to also include all the major 

reactions occurring in the skeletal muscle cell, adding 
new functions to the classical metabolic pathways found 
in the core model, such as fatty acid synthesis and 
p-oxidation, triacylglycerol and phospholipid 
15 formation, and amino acid metabolism. Simulations were 
performed using the muscle cell reaction database shown 
in Table 4. The biochemical reactions were again 
compartmentalized into cytosolic and mitochondrial 
compartments . 

20 To simulate physiological behavior of human 

skeletal muscle cells, an objective function had to be 
defined. Growth of muscle cells occurs in time scales 
of several hours to days. The time scale of interest 
in the simulation, however, was in the order of several 

25 to tens of minutes, reflecting the time period of 

metabolic changes during exercise. Thus, contraction 
(defined as, and related to energy production) was 
chosen to be the objective function, and no ■ additional 
constraints were imposed to represent growth demands in 

30 the cell. 
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To study and test the behavior of the 
network, twelve physiological cases (Table 5) and five 
disease cases (Table 6) were examined. The input and 
output of metabolites were specified as indicated in 
Table 5, and maximum energy production and metabolite 
secretions were calculated and taken into account. 



Table 5 



Metabolite 



10 



11 



12 
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Table 6 



67 



Disease* 


Enzyme Deficiency 


Reaction 
Constrained 


McArdle's disease 


phosphorylase 


GBE1 


Tarui's disease 


pnospnoiructoju.an&c 


PFKL 


Phosphoglycerate 
kinase deficiency 


phosphoglycerate 
kinase 


PGK1R 


Phosphoglycerate 
mutase deficiency 


phosphoglycerate 
mutase 


PGAM3R 


Lactate 

dehydrogenase 

deficiency 


Lactate dehyrogenase 


LDHAR 



10 



The skeletal muscle model was tested for 
utilization of various carbon sources available during 
various stages of exercise and food starvation (Table 
15 5) . The by-product secretion of the network in an 

aerobic to anaerobic shift was qualitatively compared 
to physiological outcome of exercise and found to 
exhibit the same general features such as secretion of 
- fermentative by-products and lowered energy yield. 



20 The network behavior was also examined for 

five disease cases (Table 6) . The test cases were 
chosen based on their physiological relevance to the 
model's predictive capabilities. In brief, McArdle's 
disease is marked by the impairment of glycogen 

25 breakdown. Tarui's disease is characterized by a 
deficiency in phosphof ructokinase . The remaining 
diseases examined are marked by a deficiency of 
metabolic enzymes phosphoglycerate kinase, 
phosphoglycerate mutase, and lactate dehydrogenase. In 

30 each case, the changes in flux and by-product secretion 
of metabolites were examined for an aerobic to 
anaerobic metabolic shift with glycogen and 
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phosphocreatine as the sole carbon sources to the 
network and pyruvate, lactate, and albumin as the only 
metabolic by-products allowed to leave the system. To 
• simulate the disease cases, the corresponding deficient 
5 enzyme was constrained to zero. In all cases, a severe 
reduction in energy production was demonstrated during 
exercise, representing the state of the disease as seen 
in clinical cases. 

Throughout this application various 
10 publications have been referenced. The disclosures of 
these publications in their entireties are hereby 
incorporated by reference in this application in order 
to more fully describe the state of the art to which 
this invention pertains. 

15 Although the invention has been described 

with reference to the examples provided above, it 
should be understood that various modifications can be 
made without departing from the spirit of the 
invention. Accordingly, the invention is limited only 

20 by the claims. 
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Table! 

Locus ID Gene Ab. Reaction Stolchiometry E.C. 

1. Carbohydrate Metabolism 

1 .1 Glycolysis / Gluooneogenesis [PATH:hsa00010] 



ssm HK1 


GLC + ATP -> G6P + ADP 


2.7.1.1 


3293 HK2 


GLC + ATP -> G5P + ADP 


2.7.1.1 


3101 HK3 


GLC + ATP -> G6P + ADP 


2.7.1.1 


2645 GCK, HK4, MODY2, NIDDM 


GLC + ATP -> G6P + ADP 


.2.7. 1.2 


2535 G6PC, G6PT 


G6P + H20->GLC + PI 


3.1.3.9 


2621 GPI 


G5P <-> F6P 


5.3.1.9 


5211 PFKL 


F6P + ATP->FDP + ADP 


2-7.1.11 


5213 PFKM 


F6P+ATP->FDP + ADP 


2.7.1.11 


5214 PFKP, PFK-C 


F6P + ATP->FDP + ADP 


2.7.1.11 


5215 PFKX 


F6P + ATP->FDP + ADP 


2-7-1-11 


2203 FBP1, FBP 


FDP + H20 -> FBP + PI 


3.1.3.11 


87B9 FBP2 


FDP + H20 -> F6P + PI 


3.1.3.11 


226 ALDOA 


FDP <-> T3P2 + T3P1 


4.1.2.13 


229 ALDOB 


FDP <-> T3P2 + T3P1 


4.12.13 


230 ALDOC 


FDP <-> T3P2 + T3P1 


4.1.2.13 


7167 TPI1 


T3P2 <-> T3P1 


5.3.1.1 


2532 GAPD, GAPDH 


T3P1 + PI + NAD <-> NADH + 13PDG 


1.2.1.12 


26330 GAPDS, GAPDH-2 


T3P1 + PI + NAD <-> NADH + 13PDG 


1.2,1,12 


5230 PGK1 , RGKA 


13PDG + ADP <-> 3PG + ATP 


2.7.2.3 


5233 PGK2 v 


13PDG + ADP <-> 3PG + ATP 


2.7.2.3 


5223 PGAM1, PGAMA 


13PDG->23PDG 


5.4.2.4 




23PDG + H20 -> 3PG + PI 


3.1.3.13 




3PG <-> 2PG 


5.4.2.1 


5224 PGAM2, PGAMM 


13PDG <-> 23PDG 


54.2.4 




23PDG + H20 -> 3PG + PI 


3.1.3.13 




3PG <-> 2PG 


5.4.2.1 


£69 BPGM 


13PDG<->23PDG 


5.42.4 




23PDG + H20 <-> 3PG + PI 


3.1,3.13 




3PG <-> 2PG 


5.42.1 


2023 EN01,PPH, EN01L1 


2PG <~> PEP + H20 


4.2.1.11 


2026 EN02 


2PG <-> PEP + H20 


4,2.1.11 


2027 EN03 


2PG <-> PEP + H20 


4.2.1.11 


26232 EN01B 


2PG <-> PEP + H20 


4.2.1.11 


5313 PKLR, PK1 


PEP + ADP -> PYR + ATP 


2.7.1.40 


5315 PKM2. PK3, THBP1, OIP3 


PEP + ADP -> PYR + ATP 


2.7.1.40 


5160 PDHA1, PHE1A, PDHA 


PYRm + COAm + NADm -> + NADHm + CQ2m + ACCOAm 


1.2.4,1 


5161 PDHA2, PDHAL 


PYRm + COAm + NADm -> + NADHm + C02m + ACCOAm 


1.2.4.1 


5162 PDHB 


PYRm + COAm + NADm -> + NADHm + C02m + ACCOAm 


1.2.4.1 


1232 DLAT, DLTA, PDG-E2 


PYRm + COAm + NADm -> + NADHm + C02m + ACCOAm 


2.3.1.12 


6Q5Q PDX1.E3BP 


PYRm + COAm + NADm -> + NADHm + C02m + ACCOAm 


2.3.1.12 


3S39 LDHA, LDH1 


NAD + LAC <-> PYR + -NADH 


1.1.127 


3945 LDHB 


NAD + LAC <-> PYR + NADH 


1.1.1.27 


3948 LDHC, LDH3 


NAD + LAC <-> PYR + NADH 


1.1.127 


5236 PGM1 


G1P <-> GBP 


5.4.2.2 


5237 PGM2 


G1P <-> GBP 


5.4.2.2 


5233 PGM3 


G1P<->G6P 


5.4.2.2 


1738 DLD, LAD, PHE3, DLDH, E3 


DLIPOm + FADm <-> UPOm + FADH2m 


1.8.1.4 


124 ADH1 


ETH + NAD <-> ACAL + NADH 


1.1.1.1 


125 ADH2 


ETH + NAD <-> ACAL + NADH 


1.1.1.1 


126 ADH3 


ETH + NAD <-> ACAL + NADH 


1.1.1.1 


12Z ADH4 


ETH + NAD <-> ACAL + NADH 


1.1.1.1 


12B ADH5 


FALD + RGT + NAD <-> FGT + NADH 


1.2.1.1 




ETH + NAD <-> ACAL+ NADH 


1.1.1.1 


12© ADH6 


ETH + NAD <-> ACAL + NADH 


1.1.1.1 


131 ADH7 


ETH + NAD <-> ACAL + NADH 


1.1.1.1 


10327 AKR1A1, ALR, ALOR1 




1.1.1.2 


32 ACYP1 




3.6.1.7 


SB ACYP2 




3.6.1.7 


1 2 Citrate cycle (TCA cycle) PAtH:hsa00020 






1431 CS 


ACCOAm + OAm + H20m -> COAm + CITm 


4 1.3.7 


46 AC01,1REB1,IRP1 


cit<->icit 


42.1.3 


5Q AC02 


CITm <-> ICTTm 


4.2.1.3 


3412 IDH1 


ICIT + NADP -> NADPH + C02 + AKG 


U.1.42 
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341A1DH2 

3419 IDH3A 

3420 IDH3B 

3421 IDH3G 
4S5IOGDH 
1743 DLST, DLTS 
fiBQ2 SUCLG1, SUCLA1 
BB03 SUCLA2 

2221 FH 
4190 MDH1 
4J51MDH2 
soai PC, PCB 

4ZACLY,ATPCL,CLATP 
3B5Z 

PCK2, PEPCK 
1.3 Pentose phosphate cycle PATH:hsaD003Q 
2523 G6PD, G6PD1 
2K3H6PD 

25ZS5 PGLS, 6PGL 
522&PGD 
6120 RPE- 
J086 TKT 



g22Z TKTL1 , TKR, TKT2 



ICITm + NADPm -> NADPHm + C02m + AKGm 

ICITm + NADm -> C02m + NADHm + AKGm 

ICITm + NADm -> C02m + NADHm + AKGm 

ICITm + NADm -> CQ2m + NADHm + AKGm 

ST Gm + NADm + COAm -> C02m ♦ NADHm + SUCCOAm 

AKGm + NADm ♦ COAm -> C02m ♦ NADHm + SUCCOAm 

GTPm + SUCCm + COAm <-> GDPm + Pirn + SUCCOAm 

ATPm + SUCCm + COAm ~ ADPm + Pirn + SUCCOAm 

FUMm + H20m <-> MALm 

MAL + NAD <-> NADH + OA 

MALm + NADm <-> NADHm + OAm 

pYRm + ATPm + C02m -> ADPm + OAm + Pim 

ATP + CIT + COA + H20 -> ADP + PI + ACCOA + OA 

OA + GTP -> PEP + GDP + C02 

OAm + GTPm -> PEPm + GDPm + C02m 

G6P + NADP <-> D6PGL + NADPH 

D6PGL + H20 -> D6PGC 
D6PGL+H20->D6PGC 
D6PGC + NADP -> NADPH + C02 + RL5P 
RL5P <-> X5P 
R5P + X5P <-> T3P1 + S7P 
X5P + E4P <-> F6P + T3P1 
R5P + X5P<->T3P1+S7P 
X5P + E4P<->F6P+T3P1 
T3P1+S7P<->E4P + F6P 
R5P + ATP <-> PRPP + AMP 
R5P + ATP <-> PRPP + AMP 



eaasTALDoi 

5631 PRPS1 , PRS I, PRS, I 
5S24PRPS2, PRS II, PRS, II 

-1 4 Pentose and gluconate Interconversions PATH:hsaO0O40 

221 AKR1B1, AR, ALDR1. ADR ^ uopG + ppJ 

1359 UGP1 UDPG + P p, 

23SQ UGP2, UGPP2 61P + U1 p 

7358 UGDH, UDPGDH 
10720 UGT2B11 

UGT1A1, UGT1A, GNT1, UGT1 
7361 UGT1A, UGT1, UGT1A 
2362 UGT2B. UGT2, UGT2B 
J362 UGT2B4, UGT2B11 
7364 UGT2B7, UGT2B9 
7365UGT2B10 
J256UGT2B15, UGT2B8 
7367 UGT2B17 

13AADAC, DAC 
2921 LIPE, LHS, HSL 
1 5 Fructose and mannose metabolism PATH:hsa0005l 

4351MPI.PMM iml 



SZ2PMM1 
52Z3PMM2, CDG1.CDGS 
2Z62GMDS 
8790 FPGT, GFPP 
52DIPFKFB1,PFRX 

52Qfi PFKFB2 

52Q9 PFKFB3 

521QPFKFB4 

2Z35KHK 
6652 SORD 

2525 FUT4, FCT3A, FUOTIV 
2529 FUT7 
3026 HAS1 , HAS 
202ZHAS2 



MAN6P <-> F6P 
MAN6P <-> MAN1P 
MAN6P <-> MAN1P 



ATP + F6P->ADP + F26P 

F26P -> F6P + PI 

ATP + F6P->ADP + F26P 

F26P -> F6P + PI 

ATP + F6P -> ADP + F26P 

F26P->F6P + PI 

ATP + F6P->ADP + F26P 

F26P -> F6P + PI 

DSOT + NAD ->FRU + NADH 



1 .1.1.42 

1.1.1.41 

.1,1.1.41 

1-1-1.41 

12.4.2 

2.3.1.61 

6.2.1.4 

6 2.1.4 

4^X2 

1.1.1.37 
1-1-1.37 
fi.4.1.1 
4 1-3.8 

4 1.1.32 
41 1.32 

1 1-1-49 
1.1.1.47 
3.1.1.31. 
3 1.1.31 

1 1-1.44 
5.1,3d 
? 2.1.1 

2 2.1.1 
22X2- 

9.7.6.1 
27.6.1 
1.1.1.47 

1 1-1-21 
27.7.9 
27.7.9 
XLX22. 
7.4.1.17. 
2,4,1.17. 

2 4.1.17 
94.1.17 

2,4.1.17 

2.4.1.17. 

2.4.1 .17. 

2.4.1.17- 

7A1.17 

3.1 .1.-- 

3X1= 

5-3.1. B 
S . 42.S 

5A2.fi 

L 9.1.47 
9 7-7.30 
9 7.1.105 



2, 7-1.105 
3 1-3.46 
?7 1.105 
? , 1-3.46 
9 7.1.105 
3.1.3.46 
9,7.1.3 
1 , 1.1.14 
2,4-1. - 
2 f 4.1.• 
2Xlr 

2. 4.1 
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£4Z3 OGT, O-GLCNAC 
51144 LOC51 144 
1.6 Galactose metabolism PATH:hsa00O52 

25S4GALK1.GALK GLAC + ATP -> GAL1P + ADP 

2585 GALK2, GK2 GLAC + ATP -> GAL1P + ADP 

2592 GALT UTP + GAL1P <-> PPI + UDPGAL 

2582 GALE UDPGAL <-> UDPG 

2Z2DGLB1 
3938 LCT, LAC 

™, B4GALT1, GGTB2, BETA4GAL-T1, 
^GTI.GTB 



3225 LALBA 
2Z12GLA, GALA 
254 8 GAA 

2534 GANAB ' 

2525 GANC 

89Z2 MGAM, MG, MGA 



MEL! -> GLC + GLAC 
MLT -> 2 GLC 
6DGLC -> GLAC + GLC 
MLT -> 2 GLC 
6DGLC -> GLAC + GLC 
MLT -> 2 GLC 
6DGLC -> GLAC + GLC 
MLT -> 2 GLC 
6DGLC -> GLAC + GLC 



1.7 Ascorbate and atdarate metabofism PATH±tsa00053 

215 ALDH1, PUMB1 ACAL + NAD -> NADH + AC 

21Z ALDH2 ACALm + NADm •> NADHrn + ACm 

212 ALDH5, ALDHX 
223ALDH9, E3 



22A ALDH10, FALDH, SLS 
SS54RALDH2 
1521CYP24 
1532 CYP25A1, P450RAI 
1593 CYP27A1, CTX, CYP27 

CYP27B1. PDDR, VDD1, VDR, CYP1, 
VDDR, I.P45DC1 
1.8 Pyruvate metabofism PATH :hsa 00620 
54388 FLJ205B1 

31ACACA,ACAC,ACC 



1594 : 



ATP + AC + CO A -> AMP + PPI + ACCOA 
ACCOA + ATP + C02 <-> MALCOA + ADP + PI + H 



32 ACACB, ACCB, HACC275, ACC2 ACCOA + ATP + C02 <-> MALCOA + ADP + PI + H 



2232GL01.GLYI 
2022 HAGH, GL02 
223 FDH 

S36QGRHPR, GLXR 
42QQME2 
1MQME3 
29S2Z HUMNDME 
4122 ME1 

38 ACAT1, ACAT. T2, THIL, MAT 
39ACAT2 

1.9 Glyoxytate and cficarboxytate metabolism PATH:hsaOOS30 
5240 PGP 
2258 GLYD 

10797 MTHFD2, NMDMC 



RGT + MTHGXL <-> LGT 
LGT -> RGT + LAC 
FALD + RGT + NAD <-> FGT + NADH 

MAUn + NADm -> C02m + NADHm + PYRm 
MALm + NADPm -> C02m + NADPHm + PYRm 
MAL + NADP -> C02 + NADPH + PYR 
MAL + NADP -> C02 + NADPH + PYR 
2 ACCOAm <-> COAm + AACCOAm 
2 ACCOAm <-> COAm + AACCOAm 



4S2MTHFD1 



1.10 Propanoate metabofism PATH:hsa00640 
34 ACADM, MCAD 



5SACADSB 



3HPm + NADHm -> NADm + GLYAm 
METHF <-> FTHF 
METTHF + NAD -> METHF + NADH 
METTHF + NADP <-> METHF + NADPH 
METHF <-> FTHF 

THF + FOR + ATP -> ADP + PI + FTHF 

MBCOAm + FADm -> MCCOAm + FADH2m 
IBCOAm + FADm -> MACOAm + FADH2m 
IVCOAm + FADm -> MCRCOAm + FADH2m 
MBCOAm + FADm -> MCCOAm + FADH2m 



2.4.1.- 
1.1.1.- 

2.7.1.6 

2.7.1.6 

2.7.7.10 

5.1.3.2 

3.2.123 

32X52 

22X108 

2.4.1.90 

2.4.1.38 
2.4.1.22 
2X122 
3.2.1.22 
3.2.1.20 

32.1.20 

3.2.1.20 

32X20. 

32X3 

1.2.1.3 

1.2.1.3 

1.2.1.3 

1.2.1.3 

,1.2.1.12 

1.2.1.3 

1.2.1.3 

1.14.-.- 

1.14,-.- 

1.14.-.- 

1.14.-.- 



6.2.1.1 
6.4.1.2 
6.3.4. 14 
6.4.1.2 
6.3.4.14 

4.4.1,5 

3.1.2.6 

12.1.1 

1.1.1.72 

11X38 

1X1.40 

1.1X40 

1.1.1.40 

2.3.1.9 

2.3.1.9 

3.1,3.18 

1.1.1.29 

3.5.4.9 

1.51.15 

X5X15 

3.5A9 

6.3.4,3 

1,3.99.3 



13.99.3 
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Jfia2ECHS1,SCEH 
1962 EHHADH 

3030 HADHA, MTPA, GBP 



23417 MLYCD, MCD 
ifi ABAT, GABAT 
5Q25PCCA 
5Q2£PCCB 
4594 MUT, MCM 
4329 MMSDH 

£523 FACVL1 , VLCS, VLACS 
1.11 Butanoate metaborism PATH."hsa00650 

HADH2, ERAB 

2Q32 HADHSC, SCHAD 
55ACADS. SCAD 

7915 ALDH5A1 , SSADH, SSDH 
2571 GAD1, GAD, GAD67, GAD25 
25Z2GAD2 
2573 GAD3 

3157 HMGCS1, HMGCS 

3158 HMGCS2 
2155HMGCU HL 
5Q1SOXCT 

J522BDH 

lfi22 DBT, BCATE2 



1.13 Inositol metabolism PATH:hsa00031 
2. Energy Metabolism 

2.1 Oxidative phosphorylation PATH:hsa001 90 
4535MTND1 

4536 MTND2 

4537 MTND3 

4538 MTNP4 

4539 MTNID4L 

4540 MTND5 

4541 MTND6 
NDUFA1, MWFE 

4695 NDUFA2, B8 

4696 NDUFA3, B9 

4697 NDUFA4, MLRQ 

4SSa NDUFAS, UQOR13, B13 

4700 NDUFAS, B14 

4701 NDUFA7, Bl45a. B14.5A 
4ZQ2 NDUFA8, PGIV 

4704 NDUFA9, NDUFS2L 
4ZQ5 NDUFA10 



IBCOAm + FADm -> MACOAm + FADH2m 
IVCOAm + FADm -> MCRCOAm + FADH2m 
MACOAm + H20m -> HIBCOAm 
MCCOAm + H20m -> MHVCOAm 
MHVCOAm + NADm -> MAACOAm + NADHm 
HlBm + NADm -> MMAm + NADHm 
MACOAm + H20m -> HIBCOAm 
MCCOAm + H20m ■> MHVCOAm 
MHVCOAm + NADm -> MAACOAm + NADHm 
HlBm + NADm -> MMAm + NADHm 
MACOAm + H20m -> HIBCOAm 
MCCOAm + H20m ■> MHVCOAm 

' C160CARm + CO Am + FADm + NADm -> FADH2m + NADHm + 
C140COAm + ACCOAm 

GABA + AKG -> SUCCSAL + GLU 

PROPCOAm + C02m + ATPm -> AD Pm + Pirn + DMMCOAm 
PROPCOAm + C02m + ATPm -> ADPm + Pirn + DMMCOAm 
LMMCOAm -> SUCCOAm 

MMAm + COAm + NADm -> NADHm + C02m + PROPCOAm 



C140COAm + 7COAm + 7 FADm + 7 NADm -> 7 FADH2m + 7 
NADHm + 7 ACCOAm 

MBCOAm + FADm -> MCCOAm + FADH2m 
IBCOAm + FADm -> MACOAm + FADH2m 

GLU -> GABA + C02 

GLU -> GABA + C02 

GLU -> GABA + C02 

H3MCOA + COA <-> ACCOA + AACCOA 

H3MCOA + COA <-> ACCOA + AACCOA 

H3MCOAm -> ACCOAm + ACTACm 

3HBm + NADm -> NADHm + Hm + ACTACm 
OMVALm + COAm + NADm -> MBCOAm + NADHm + C02m 
OIVALm + COAm + NADm -> IBCOAm + NADHm + C02m 
OICAPm + COAm + NADHm -> IVCOAm + NADHm + C02m 



NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm-> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm +4 Hm -> QH2m + NADm +4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 H m -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 
NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H . 



4.2.1.17 

1-1.1-35 

4.2.1.17 

1.1.1.35 

4,2.1.17 

1.1.1-35 
4J2.1.17 

.4.1.1,9 

?.6.1.19 

B.4.1.3 

6.4.1.3 

54.99-2 

1-2.1.27 

6.2.1.- 

.1.1.1.35 

1-1.1.35 
1-3.99.2 

1.2.1.24 
4.1.1.15 
4.1.1.15 

4.1.1.15 
4.1.3.5 
4.1.3.5 
4.1.3.4 

.1.1.1,30- 
2.3.1 ■- 



1,6.5.3 

1 , 6-5.3 

1 r 6.5.3 

1-6.5.3 

1-6.5.3 

1.6.5.3 

1-6.5.3 

16.5.3 

1-6.5.3 

1 .6.99.3 

1-6.5.3 

j .6.99,3 

1-6.5.3 



1-6.5.3 
1.6.99.3 
1 -6.5.3 
1.6.99.3 
1.6.5.3 



1.6.5.3 
J, 6 .99.3 

1-6.5.3 

_1 ,6.5.3 
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NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1,6.99,3 


4706 NDUFAB1, SDAP 


NADHm + Qm +4 Hrn -> QH2m + NADm +4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 


4707 NDUFB1, MNLL, CI-SGDH 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1 .6.99.3 


470B NDUFB2, AGGG 


NADHm + Qm +4 Hm-> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm •> QH2m + NADm +4 H 


1.6.99.3 


4709 NDUFB3. B12 


NADHm + Qm + 4 Hm ■> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


JUL29*3 


4Zlfi NDUFB4, B15 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm +4 Hm -> QH2m + NADm +4 H 


1.6.99.3 


4ZHNDUFB5,SGDH 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4Hm-> QH2m + NADm + 4 H 


1.6.99.3 


4712 NDUFB6, B17 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 


4Z13 NDUFB7, B1B 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 


4714 NDUFB8. ASHI 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 


4715 NDUFB9, UQOR22, B22 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm •> QH2m + NADm + 4 H 


1.6.99.3 


4716 NDUFB10, PDSW 


NADHm + Qm + 4 Hm ■> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4Hm-> QH2m + NADm + 4 H 


1-6.99.3 


4ZLZNDUFC1.KFYI 


NADHm + Qm + 4 Hm •> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm +4 Hm -> QH2m + NADm +4 H 


1.6.99.3 


471 B NDUFC2, B14.5b, B14.5B 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 


NDUFS4, AQDQ 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 


4725 NDUFS5 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1-6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 


4726 NDUFS6 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 


4731 NDUFV3 


NADHm + Qm + 4 Hm QH2m + NADm + 4 H 


1.6.5-3 


4727 NDUFS7, PSST 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4Hm-> QH2m + NADm + 4 H 


1-6.99.3 


4722 NDUFS3 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 




4720 NDUFS2 


NADHm + Qm + 4 Hm -> QH2m + NADm +4 H 


1.6.5.3 


4Z29 NDUFV2 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 


4223NDUFV1.UQOR1 


NADHm + Qm +4 Hm-> QH2m + NADm +4 H 


1.6.5.3 




NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.993 


4Z13NDUFS1.PRO1304 


NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 


1.6.99.3 




NADHm + Qm + 4 Hm-> QH2m + NADm + 4 H 


1.6.5.3 


4729 NDUFS8 


NADHm +Qm +4 Hm ~> QH2m + NADm +4 H 


1.6.5.3 




NADHm + Qm + 4 Hm-> QH2m + NADm + 4 H 


1-6 99.3 


63H1SDHC 


SUCCm + FADm <-> FUMm + FADH2m 


1.3.5.1 




FADH2m + Qm <-> FADm + QH2m 




£322 SDHD, CBT1, PGU PGL1 


SUCCm + FADm <-> FUMm + FADH2m 


1.3-5.1, 




« FADH2m + Qm <-> FADm + QH2m 




£SSa SDHA, SDH2, SDHF, FP 


SUCCm + FADm <-> FUMm + FADH2m 


1.3,5.1 




FADH2m + Qm <-> FADm + QH2m 




fiSSQSDHB, SDH1.IP. SDH 


SUCCm + FADm <-> FUMm + FADH2m 


1.3.5.1 




FADH2m + Qm c-> FADm + QH2m 




2285 UQCRFS1 , RIS1 


02m + 4 FEROm + 4 Hm -> 4 FERIm + 2 H2Cm + 4 H 


1.10.2.2 


4512 MTCYB 


02m + 4 FEROm + 4 Hm -> 4 FERIm + 2 H20m + 4 H 


1.10.2.2 


153ZCYC1 


02m + 4 FEROm + 4 Hm -> 4 FERIm + 2 H20m + 4 H 


1-10.2.2 


7384 UQCRC1, D3S3191 


02m + 4 FEROm + 4 Hm -> 4 FERIm + 2 H20m + 4 H 


1.10.2.2 


73fi5UQCRC2 


02m + 4 FEROm + 4 Hm -> 4 FERIm + 2 H20m + 4 H 


1-10.2J> 


Z3SS UQCRH 


Q2m + 4 FEROm + 4 Hm -> 4 FERIm + 2 H2Dm + 4 H 


1.10,2^. 


7381 UQCRB, QPC, UQBP, QP-C 


02m + 4 FEROm + 4 Hm -> 4 FERIm + 2 H20m + 4 H 


1.10.2.2 


2ZQS3 QP-C 


02m + 4 FEROm + 4 Hm -> 4 FERIm + 2 H20m + 4 H 




J1G2Z5UQCR 


02m + 4 FEROm + 4 Hm -> 4 FERIm + 2 H20m + 4 H 


1.10.2.2 


JU83COX5BL4 


QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 


1-93.1 


45HMTC03 


QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 


1A3J 


4S12MTC01 


QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 


.1,9.3.1. 



WO 03/082214 



PCT/US03/09751 



' 74 

QH2m + 2FERIm+4Hm->Qm + 2FEROm + 4H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 

QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm +4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 
ADPm + Pirn + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pirn + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
aBAi^i a.rou ADPm + Pim + 3H ■> ATPm + 3 Hm + H20m 

43S ATP5A1, ATP5A, ATPM. OMR, HATP1 ADPm + Fim + 3 H -> ATPm + 3 Hm + H20m 
„q ATP50 ATPO OSCP ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 

^ ATP5 °- AT1 u ' u ^ ADPm + Pim+3H -> ATPm + 3 Hm + H20m 

ADPm + Plm + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm +■ 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm+Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
S&pjpsr ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 

TCIRG1, TIRC7, OC-116, 0O116kDa. + pim + 3 H _> ATPm + 3 Hm + H20m 

^OC-116KDA,ATP6N1C _ TO _, u _ w ~ m 



4513 MTC02 
1322COX5B 
1327 CQX4 
1337 COX6A1.COX6A 
1523COX6A2 
1340 C0X6B 

1345 C0X6C 

9377 COX5A, COX, VA, COX-VA 

1346 COX7A1, COX7AM, COX7A 

1347 COX7A2, COX Vlla-L 

1348 COX7A3 

1349 COX7B 

9167 COX7A2L, COX7RP, EB1 

1350 COX7C 
J351COX8, COX VIII 

4508 MTATP6 

4509 MTATP8 
499 ATP5A2 
5QZATP5BL1,ATPSBL1 
5Q5 ATP5BL2, ATPSBL2 
SlfiATPai 

537 ATP6S1 , ORF, VATPS1 , XAP-3 
514 ATP5E 
513ATP5D ' 
5Qg ATP5B, ATPSB 
* 5Q3 ATP5C1 , ATP5C 



516 ATP5G1 , ATP5G 
517ATP5G2 
51SATP5G3 
515ATP5F1 

521 ATP51 

522 ATP5J, ATP5A, ATPM, ATP5 
3551 ATP5J2, ATP5JL, F1FO-ATPASE 

10476 ATP5JD 
1Q£32 ATP5JG 
S29fiATP6S14 
52SATP6D 

523 ATP5A1 , VPP2 
524ATP6A2, VPP2 
525 ATP6B1, VPP3, VATB 
526ATP6B2, VPP3 
523ATP6E 
527 ATP6C, ATPL 
533 ATP6F 



■ 0C-116KDA, ATP6N1C 
23545 TJ6 
5Q51Z ATP6N1B 

535ATP6N1 
513B2VA7D 
5992 ATP6H 
9550 ATP6J 
515Q5LOC51606 

495 ATP4A, ATP6A 

496 ATP4B. ATP6B 
47BATP1A1 
477ATP1A2 
478ATP1A3 

479 ATP1AL1 
23439 ATP 1B4 

431 ATP1B1, ATP1B 

482ATP1B2,AMOG 

483ATP1B3 
2ZQ32 ATP2C1 , ATP2C1 A, PMR1 



ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm+.Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H •> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3H -> ATPm + 3 Hm + H20m 
ADPm + Pim + 3 H -> ATPm + 3 Hm + H20m 
ATP + H + Kxt + H20 <-> ADP + PI + Hext + K 
ATP + H + Kxt + H20 <-> ADP + PI + Hext + K 
ATP + 3MA + 2Kxt + H20<->ADP + 3NAxt + 2K + PI 
ATP+3NA + 2Kxt + H20<->ADP + 3NAxl + 2K + Pl 
ATP + 3NA + 2Kxt+H20<->ADP + 3NAxt + 2K + PI 
ATP + 3NA + 2Kxt+H20<->ADP + 3NAxt + 2K + PI 
ATP + 3NA + 2Kxt + H20 <-> ADP + 3 NAxt + 2 K + PI 
ATP + 3 NA + 2 Kxt + H20 <-> ADP + 3 NAxt + 2 K + PI 
ATP + 3NA + 2Kxt + H20<->ADP + 3NAxt + 2K + Pl 
ATP + 3NA + 2Kxt + H20-=->ADP + 3NAxt + 2K + Pl 
ATP +2CA + H20 <-> ADP + PI +2 CAxt 



1.9-3.1 
1-9.3.1 
1.9.3.1 

1.9.3.1 

1.9-3.1 
1-9-3.1 
1.9.3.1 
1.9.3.1 
J .9.3.1 
1-9.3.1 
1.9.3.1 

1.9,3.1 

1.9.3.1 

3.6.1 .34 

3-61.34 

3,6.1,34 

3.6.1.34 

3ALM 

3.6.1.34 

3.6.1.34 

3-6.1.34 

3-6.1.34 

3.6.1.34 

3.6.1.34 

36.1.34 

3.6.1.34 

3.6.1 .34 

?.6.1.34 

3.6.1.34 

36.1.34 

3-6.1.34 

36.1.34 

3.6.1.34 

3-6.1.34 

3.6.1.34. 

3,6.1.34 

3,6.1.34 

3.6,1.34 

3-6.1.34 
3 6-1-34 
36.1.34 
3,6.1.34 
3-6.1.34 
3.6.1.34 

3 6,1.34 

3.6.1.34 

3.6.1.34 
3.6.1.34 
3.6.1.34 
3, 61-34 
3,6,1.34. 

36 1.34 
3 6 1.36 
3 6 1.36 
36 1-37 
3 6.1.37 
3 6 1.37 
3 6.1.37 
3 6.1.37 
3.6.1 r37 
3 6 1.37 

3^1,37 

36.1.38 
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467 ATP2A1, SERCA1, ATP2A 
488 ATP2A2, ATP2B, SERCA2, DAR, DD 
439ATP2A3. SERCA3 
43QATP2B1.PMCA1 

491 ATP2B2. PMCA2 

492 ATP2B3, PMCA3 

493 ATP2B4, ATP2B2, PMCA4 
53B ATP7A, MK, MNK, OHS 
540 ATP7B, WND 

5464 PP, SID6-8061 
Z2 Photosynthesis PATH:hsa00195 

2.3 Carbon fixation PATH:hsaO0710 

2SQ5GOT1 
2506 GOT2 
2875 GPT 

2.4 Reductive carboxylase cyde (C02 fixation) PATH:hsa00720 

2.5 Methane metabolism PATHiisaOOSBO 

54Z CAT 
4G25LPO, SPO 
4353 MPO 

S2M EPX, EPX-PEN, EPO, EPP 
25B3K1AA0106, AOP2 
6470 SHMT1. CSHMT 
6472 SHMT2, GLYA, SHMT 
510D4 LOC51004 



ATP + 2 CA + H20 <-> ADP + PI + 2 CAxt 
ATP + 2 CA + H20 <-> ADP + PI + 2 CAxt 
ATP + 2 CA + H20 <-> ADP + Pi + 2 CAxt 
ATP + 2 CA+ H20 <-> ADP + PI + 2 CAxt 
ATP + 2 CA + H20 <-> ADP + PI + 2 CAxt 
ATP + 2 CA + H20 <-> ADP + PI + 2 CAxt 
ATP + 2 CA + H20 <-> ADP + PI + 2 CAxt 
ATP + H20 + Cu2 -> AD P + PI + Cu2xt 
ATP + H20 + Cu2 -> ADP + PI + Cu2xt 
PPI •> 2 PI 



OAm + GLUm <-> ASPm + AKGm 
OA + GLU <-> ASP + AKG 
PYR + GLU <-> AKG + ALA 



2H2Q2->02 



9420 CYP7B1 

2.6 Nitrogen metabolism PATH:hsa00910 
1123B CA5B 
23632 CA14 

759 CA1 

Z6QCA2 

761 CA3, CAIII 

762 CA4, CAIV 

253 CA5A, CA5, CAV, CAVA 
265 CAB 

766 CA7 

767 CAS, CALS. CARP 
26BCA9. MN 

770 CA11. CARP2 

771 CA12 
1323 CPS1 

225 MAT 

3034 HAL, HSTD, HIS 
2245 GLUD1, GLUD 

8307 GLUD2 

2252 GLUU GLNS 
22542 K1AA0838 
22155 GA 

2244 GLS 
440 ASNS 

1491 CTH 

2.7 Sulfur metabolism PATH:hsaD0920 

9060 PAPSS2, ATPSK2, SK2 

9061 PAPSS1, ATPSK1, SK1 

BPNT1 
6799 SULT1A2 
5532SULT1A1.STP1 
6818 SULT1A3, STM 
£522 SULT2A1 , STD 



THF + SER <-> GLY + METTHF 
THFm + SERm <-> GLYm + METTHFm 
20PMPm + 02m -> 20PMBm 
20PMMBm + 02m -> 20MHMBm 
20PMPm + 02m -> 20PMBm 
20PMMBm + 02m -> 20MHMBm 



GLUm + C02m + 2 ATPm -> 2 ADPm + 2 Plm + CAPm 
GLYm + THFm + NADm <-> METTHFm + NADHm + C02m + 
NH3m 

HIS -> NH3 + URO 

AKGm + NADHm + NH3m <-> NADm + H20m + GLUm 
AKGm + NADPHm + NH3m <-> NADPm + H20m + GLUm 
AKGm + NADHm + NH3m <-> NADm + H20m + GLUm 
AKGm + NADPHm + NH3m <-> NADPm + H20m + GLUm 
GLUm + NH3m + ATPm -> GLNm + ADPm + Pirn 
GLN -> GLU + NH3 
GLN -> GLU + NH3 
GLNm -> GLUm + NH3m 

ASPm + ATPm + GLNm -> GLUm + ASNm + AMPm + PPlm 
LLCT + H20 -> CYS + HSER 
0BUT + NH3<->HSER 

APS + ATP -> ADP + PAPS 
SLF+ATP->PPI + APS 
APS + ATP -> ADP + PAPS 
SLF + ATP -> PPI + APS 
PAP AMP + PI 



3.6.1.38 

3.6.1.38 

3,6,1.38 

3,6.1.38. 

3.6.1.38 

3.6.1.38 

3.6.1,38 

3.6.3.4 

3.6.3.4 

3.6.1.1 



2.6.1.1 
2.6.1.1 
2.6.1.2 



1-11-1.B 
1-11.1.7 
1-11.1.7 
1.11.1-7 
11117 



2.1.2.1 

2.1.2.1 

.1.14.13.- 
.1 ,.14.13.- 



4.2.1.1 

4.2.1.1 
4.2.1.1 
4.2.1.1 
4.21.1 
4.2.1.1 
4.2.1.1 
4.2.1.1 
4.2.1.1 
4,2.1.1 
4.2.1.1 
4.2.1.1 

4.2.1.1 
.6..3.4.16 

4.3.1.3 
1.4.1.3 



1.4.1.3 

5^L2_ 
3^X2 
3*5X2 

35 1.2 
6^3.5.4 
4 4.1.1 
4 4 1.1 

2 7.1.25 

2 n 7.7.4 

? , 7.1.25 

2, 77-4 

31.3.7 

?fi.2.1 
2.6.2.1 
2JlZ2 
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6783 STE, EST 
6B21SUOX 
3. Upld Metabolism 

3.1 Fatty add biosynthesis (path 1) PATH:hsa00061 
?194 FASN 

32 Fatly acid biosynthesis (path 2) PATH*Jisa00062 

104 4 9 ACAA2, DSAEC MAACOAm -> ACCOAm + PROPCOAm 

3Q ACAAll ACAA MAACOA -> ACCOA + PROPCOA 

2022 HADHB MAACOA -> ACCOA + PROPCOA 

3.3 Fatty add metabolism PATH:hsa00071 
5lACOX1,ACOX 
33 ACADL, LCAD ' 
2539 GCDH 

21Z9FACL1 LACS ATP + LCCA + COA <■> AMP + PPl + ACOA 

2m FACL2! FACL1, LACS2 ATP + LCCA + COA <-> AMP + PPl + ACOA 

21S2 FACU, ACS4 ATP + LCCA + COA <-> AMP + PPl + ACOA 

1374 CPT1/C CPT1, CPT1-L 

1375 CPT1B, CPT1-M 

1376 CPT2, CPT1 , CPTASE 
JS32DC1 

11283 CYP4F8 
4543CYP1A1.CYP1 
1544 CYP1A2 
1545CYP1B1;gLC3A 

1548 CYP2A6, CYP2A3 

1549 CYP2A7 
1551 CYP3A7 
_1553CYP2A13 

1554 CYP2B 

1555 CYP2B6 

1557 CYP2C19, CYP2C, P450IIC19 
1558CYP2CB 

1559 CYP2C9, P450UC9, CYP2C10 
1562 CYP2C18, P450UC17, CYP2C17 
1565 CYP2D6 

1571 CYP2E, CYP2E1, P450C2E 

15Z2CYP2F1.CYP2F 

1573 CYP2J2 

1575 CYP3A3 

1576 CYP3A4 

1577 CYP3A5, PCN3 
1580 CYP4B1 
iSBS CYP19. ARO 
1525CYP51 

iS^AHHR, AHH 

3.4 Synthesis and degradation of ketone bodies PATH:hsa00072 

3.5 Sterol biosynthesis PATH:hsa00100 



PCT/US03/09751 



2.R.2.4 
1.8.3.1 



2.3.1 .85_ 

2,3,1,16 
2.3.1.16 
2.3.1.16 

1,3,99.13 

1-3.99.7 

6.2.1.3 

6.2.1.3 

6.2.1.3 

2,3,1.21 

2,3.1.21 

2.3.1.21 

5.3.3.8. 

1-14.14.1 

1.14.14.1 

1,14.14,1 

1.14.14.1 

1,14,14.1 

114.14.1 

1.14.14,1 

1.14.14.1 

1.14.14.1 

1.14.14,1 

1.14.14.1 

1.14.14.1 

1.14.14.1 

1.14.14.1 

1.14.14.1 

1-14.14.1 

1-14.14.1 

1.14.14.1 

1-14.14.1 

114.14.1 

114.14.1 

1.14.14.1 

1.14.14.3 

1 14.14.1 

1.14.14.1 



3156 HMGCR 
4598 MVK, MVLK 



j£654 PMVK, PMKASE, PMK, HUMPMKl 
4527 MVD, MPD 
3422 IDH 
2224 FDPS 



2453 GGPS1 , GGPPS 



Z22Z FDFT1, DGPT 
5Z13SQLE 
4047 LSS, OSC 

<J22l DIA4, NMOR1, NQ01, NMORl 
NMOR2, NQ02 
3Z ACADVL, VLCAD, LCACD 
3.6 Bile acid biosynthesis PA"m:hsa00120 



MVL + COA + 2 NADP <-> H3MCOA + 2 NADPH 
ATP + MVL -> AD P + PMVL 
CTP + MVL -> CDP + PMVL 
GTP + MVL -> GDP + PMVL 
UTP + MVL -> UDP + PMVL 
ATP + PMVL -> ADP + PPMVL 
ATP + PPMVL -> ADP + PI + IPPP + C02 
IPPP <~> DMPP 
GPP + IPPP ->FPP + PPl 
DMPP + IPPP ■> GPP + PPl 
DMPP + IPPP -> GPP + PPl 
GPP + IPPP -> FPP + PPl 

2 FPP + NADPH -> NADP + SQL 
SQL + 02 + NADP -> S23E + NADPH 
S23E -> LNST 



1.1,1.34 
2 7 1.36 



Z1A2 

4.1.1.33 

5.3.3.2 

2J5JL1Q 

2.5.1.1 

2.5.1.1 

7.5.1 -10 

2.5,1.29 

1-14.99.7 

5.4.99.7 

1-6-99.2 

1.6,99,2. 

1.3.99.- 
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1Q55CEL, BSSL.BAL 

33S& LIPA, LAL 

. S0AT1, ACAT f STAT. SOAT, ACAT1, 
l ACACT 
l5aiCYP7A1,CYP7 
£215SRD5A1 
m£SRD5A2 

671 B AKR1D1, SRD5B1, 3o5bred 
570 BAAT, BAT 
3.7 C21-Sterold hormone metabolism PATH:hsa00140 
15B3 -CVP11A. P450SCC 
32B3 HSD3B1, HSD3B, HSDB3 



3284 HSD3B2 



IMZYMST -> IIMZYMST + C02 
IMZYMST -> IIZYMST + C02 

IMZYMST -> IIMZYMST + C02 
IMZYMST -> IIZYMST + C02 



CYP21A2, CYP21, P450C21B, 
-™ 1 CA21H, CYP21B, P450c21B 
J58£CYP17,P450C17 
1584 CYP11B1, P450C11, CYP11B 
l5a5CYP11B2,CYP11B 

522D HSD1 1B1, HSD11, HSD 1 1 U HSD1 1 B 

22aiHSD11B2,HSD11K 
3.8 Androgen and estrogen metabofism PATH:hsa00150 
, HSD17B1, EDH17B2, EDHB17, 
■ HSD17 
3223 HSD17B3, EDH17B3 
^34HSD17B2, EDH17B2 
2225HSD17B4 

. HSD17BP1, EDH17B1, EDHB17, 
1 HSD17 
51478 HSD17B7, PRAP 
412 STS, ARSC. ARSC1 . SSDD 
414ARSD 

415 ARSE, CDPX1, CDPXR, CDPX 
111B5 INMT 
2414Q JM23 

29104 N6AMT1, PRED28 
2225QFJH1 

32Z5 HRMT1L2, HCP1, PRMT1 
51£2B LOC51628 
54743 HASJ4442 
2Z222 HSA9761 
4. Nucleotide Metabofism 
4.1 Purine metabofism PATH:hsa00230 
11164 NUDT5, HYSAH1. YSA1H 

5471 PPAT, GPAT 

2618 GART, PGFT, PRGS 



5198 PFAS, FGARAT, K1AA0361, PURL 
1QSQ5ADE2H1 

5Q59 PAICS, AIRC, PAIS 
15fi ADSL 
4Z1ATIC.PURH 

3251 HPRT1, HPRT, HGPRT 

3814 IMPDH1 
3615 IMPDH2 
B833GMPS 
14923 
29aZGUK1 



3.1.1.3 

3,1.1.13 

3,1.1,13 

2JU26 

1,14,13.17 

1,3,99.5 

1,3,99,5 

1.3.99.6 

2.3.1.65 

114J&6 

5.3.3.1 

1.1.1.145 

5.3.3.1 

1,1.1.145 

1 .14,99.10 

1.14.99.9 
1.14.15.4 
1.14.15.4 

1.1.1.146' 

1,1.1,146 





1.1.1.62 




1,1,1.62 




.1.1,1.62 




1.1.1.62 




1.1.1.62 




11.1,62. 




2152. 




3.1 6.1 




3.1.6.1 




2.1.1 .- 




2.1 .1.- 




2.1.1 .- 




2.1.1.- 




211= 




211= 




2.1.1.- 




211= 




3.6.1.13 


PRPP + GLN-> PPI + GLU + PRAM 


2.4.2.14 


PRAM + ATP + GLY<-> ADP + PI + GAR 


6.3.4.13 


FGAM + ATP -> ADP + PI + AIR 




GAR + FTHF -> THF + FGAR 


2122 


FGAR + ATP + GLN -> GLU + ADP + PI + FGAM 




CAIR + ATP + ASP <-> ADP + PI + SAICAR 


&32£_ 


CAIR <-> AIR + C02 


41121 


CAIR + ATP + ASP <-> ADP + PI + SAICAR 


6.3^.6 


ASUC <-> FUM + AMP 


1322_ 


AICAR + FTHF <-> THF + PRFICA 


2123 


PRFICA<->IMP 


3.5.4.10 


HYXAN + PRPP -> PPI + IMP 




GN + PRPP -> PPI + GMP 




IMP + NAD -> NADH + XMP 


1. J, 1205 


IMP + NAD -> NADH + XMP 


H12QS 




£2£2_ 


GMP + ATP <-> GDP + ADP 


2.74.8 
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GUK2 



1Q621RPC39 
U3622RPC32 
10623 RPC62 
11126 RPC155 
25BS5DKFZP586M0122 
3QS34ZNRD1 
£!Qfi2 LOC51082 
51728 LOC51728 

5430 POLR2A, RPOL2, POLR2, POLRA 

5431 POLR2B, POURB 

5432POLR2C 

5433 POLR2D, HSRBP4, HSRPB4 

5434 POLR2E, RPB5, XAP4 

5435 POLR2F, RPB6, HRBP14.4 

5436 POLR2G, RPB7 

5437 POLR2H, RPB8, RPB17 
542SPOLR2I 

5439 POLR2J 

5440 POLR2K, RPB7.0 

5441 POLR2L, RPB7.6, RPB10 

5442 POLRMT, APOLMT 
54479 FU10816, Rpo1-2 
55703 FIJI 0388 

6S1BN51T 
3533 RPA40, RPA39 
1C221POLQ 

11232 POLG2, MTPOLB. HP55, POLB 
2364SPOU2 

5422 POLA 

5423 POLB 
5424POLD1,POLD 
5425POLD2 
5426 POLE 
542ZPOLE2 
542a POLG 

53BQ REV3L, POLZ, REV3 
7498 XDH 



DGMP + ATP <-> DGDP + ADP 
GMP + DATP <-> GDP + DADP 
GMP + ATP <-> GDP + ADP 
DGMP + ATP <•> DGDP + ADP 
GMP + DATP <-> GDP + DADP 



GDA, KIAA1258, CYPIN, 
2Z5SGMPR 
51292 LOC51292 
7_3ZZUOX 
524QRRM1 



6241 RRM2 



NEDASIN 



4860 NP, PNP 



IfiSQ ECGF1, hPD-ECGF 

353 APRT 
132 ADK 
1633 DDK 



ADP + RTHIO -> DADP + OTHIO 
GDP + RTHIO -> DGDP + OTHIO 
CDP + RTHIO •> DCDP + OTHIO 
UDP + RTHIO -> DUDP + OTHIO 
ADP + RTHIO -> DADP + OTHIO 
GDP + RTHIO -> DGDP + OTHIO 
CDP + RTHIO ■> DCDP + OTHIO 
UDP + RTHIO -> DUDP + OTHIO 
AND + Pl <->AD + R1P 
GSN + PI <~> GN + R1P 
DA + PI<->AD + R1P 
DG + PI«w>GN + R1P 
DIN + PI <->HYXAN + R1P 
INS + PI <-> HYXAN + R1P 
XTSINE + PI<->XAN + R1P 
DU + PI<->URA + DR1P 
DT+PI<->THY + DR1P 
AD + PRPP -> PPI + AMP 
ADN + ATP -> AMP + ADP 



2,7.4.8 



2XL5 
2, 77-6 
?7.7.6 
2 , 7-7.6 
2, 7.7.6 
2.7.7.6 
2,7,7.6. 
2.7.7.6 
?_.7.7.6 
?.7.7.6 
?T.7.6 

ZJZ3- 

2.7.7.6 

?.7.7.6 

2-7.7.6 
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2,4.2.7 

?.7.1.20 
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1716 DGUOK 
203 AK1 

2MAK2 

205 AK3 

26289 AK5 

4830 NME1, NM23, NM23-H1 

4831'nMEZ NM23-H2 

4832 NME3, DR-nm23, DR-NM23 

4B33NME4 

225IB NT5B, PNT5, NT5B-PENDING 



4877 MT3 
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ATP + AMP <-> 2 ADP 

GTP + AMP <-> ADP + GDP 

ITP + AMP <-> ADP + IDP 

ATP + AMP <-> 2 ADP 

GTP + AMP <-> ADP + GDP 

HP + AMP <-> ADP + IDP 

ATP + AMP <-> 2 ADP 

GTP + AMP <-> ADP + GDP 

ITP + AMP <-> ADP + IDP 

ATP + AMP<->2ADP 

GTP 4- AMP <-> ADP + GDP 

rrp + AMP <-> ADP + IDP 

UDP + ATP <-> UTP + ADP 

CDP + ATP <-> CTP + ADP 

GDP + ATP «-> GTP + ADP 

IDP + ATP <-> ITP + IDP 

DGDP + ATP <-> DGTP + ADP 

DUDP + ATP <-> DUTP + ADP 

DCDP + ATP <-> DCTP + ADP 

DTDP + ATP <-> DTTP + ADP 

DADP + ATP <-> DATP + ADP 

UDP + ATP <-> UTP + ADP 

CDP + ATP <-> CTP + ADP 

GDP + ATP <-> GTP + ADP 

IDP + ATP <-> ITP + IDP 

DGDP + ATP <-> DGTP + ADP 

DUDP + ATP <~> DUTP + ADP 

DCDP + ATP <-> DCTP + ADP 

DTDP + ATP <-> DTTP + ADP 

DADP + ATP <-> DATP + ADP 

UDP + ATP <-> UTP + ADP 

CDP + ATP <-> CTP + AD P 

GDP + ATP <-> GTP + ADP 

IDP + ATP<->ITP + IDP 

DGDP + ATP <-> DGTP + ADP 

DUDP + ATP <-> DUTP + ADP 

DCDP + ATP <-> DCTP + ADP 

DTDP + ATP <-> DTTP + AD P 

DADP + ATP <-> DATP + ADP 

UDPm + ATPm <-> UTPm + ADPm 

CDPm + ATPm <-> CTPm + ADPm 

GDPm + ATPm <-> GTPm + ADPm 

IDPm + ATPm <-> ITPm + IDPm 

DGDPm + ATPm <-> DGTPm + ADPm 

DUDPm + ATPm <-> DUTPm + ADPm 

DCDPm + ATPm <-> DCTPm + ADPm 

DTDPm + ATPm «-> DTTPm + ADPm 

DADPm + ATPm <-> DATPm + ADPm 

AMP + H20 -> PI + ADN 

GMP -> p| + GSN 

CMP -> CYTD + PI 

UMP->PI + URI 

IMP -> PI + INS 

DUMP->DU + PI 

DTMP -> DT + PI 

DAMP->DA+PI 

DGMP->DG + P1 

DCMP -> DC + Pi 

XMP->PI+XTSINE 

AMP->PI+ADN 

GMP -> PI + GSN 

CMP -> CYTD + PI 

UMP -> PI + URI 

IMP -> PJ + INS 

DUMP -> DU + PI 

DTMP->DT + PI 
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27.4.6 

27.4.6 

274.6 

2.7.4,6 
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4907 NT5, CD73 



7370 UMPH2 



1P846 PDE10A 



27115 PDE7B 



5136 PDE1A 



S137 PDE1C, HCAM3 



5138 PDE2A 



5139 PDE3A, CGI-PDE 



S140 PDE3B 
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DAMP->DA+PI 
DGMP->DG + PI 
DCMP->DC+PI 
XMP -> PI + XTSINE 
AMP -> PI + ADN 
GMP->P! + GSN 
CMP->CYTD + PI 
UMP ■> PI + URI 
IMP->P1 + INS 
DUMP->DU + PI 
DTMP->DT + PI 
DAMP->DA + P! 
DGMP -> DG + PI 
DCMP->DC + Pl 
XMP -> PI + XTSINE 
AMP -> PI + ADN 
GMP->PI + GSN 
CMP •> CYTD + PI 
UMP->PI + URI 
IMP ->P1 + INS 
DUMP->DU + Pt 
DTMP -> DT + PI 
DAMP->DA + Pl 
DGMP -> DG + PI 
DCMP->DC + P! 
XMP -> PI + XTSINE 
cAMP -> AMP 
cAMP->AMP 
cdAMP -> dAMP 
dMP -> IMP 
cGMP->GMP 
cCMP -> CMP 
cAMP •> AMP 
cAMP->AMP 
cdAMP -> dAMP 
c!MP->IMP 
cGMP->GMP 
cCMP -> CMP 
cAMP -> AMP 
cAMP -> AMP 
cdAMP -> dAMP 
clMP -> IMP 
cGMP->GMP 
cCMP->CMP 
cAMP-> AMP 
cAMP->AMP 
cdAMP -> dAMP 
clMP->IMP 
cGMP -> GMP 
cCMP->CMP 
cAMP->AMP 
cAMP -> AMP 
cdAMP -> dAMP 
dMP -> IMP 
cGMP -> GMP 
cCMP -> CMP 
cAMP -> AMP 
cAMP -> AMP 
cdAMP -> dAMP 
dMP -> IMP 
cGMP -> GMP 
cCMP -> CMP 
cAMP -> AMP 
cAMP->AMP 
cdAMP -> dAMP 
dMP -> IMP 
cGMP->GMP 



3.1.3.5 



3.1.3.5 



3-1-4.17 



3-1-4.17 



3.1.4.17 



3.1.4.17 



^ 14.17 



* 1 4.17 



31.4.17 
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5141 PDE4A, DPDE2 

5142 PDE4B, DPDE4, PDEIVB 

5143 PDE4C, DPDE1 

5144 PDE4D, DPDE3 

5145 PDE6A, PDEA, CGPR-A 

5146 PDE6C, PDEA2 

5147 PDE6D 

5148 PDE6G, PDEG 

5149 PDE6H 
51S PDE9A 



5152PDES1B 



5158 PDE6B, CSNB3, PDEB 
B554PDE5A 
100 ADA 

270 AMPD1.MADA 

271 AMPD2 
2Z2AMPD3 
953ENTPD1,CD39 

3704 ITPA 

107 ADCY1 

108 ADCY2, HBAC2 

109 ADCY3, AC3, KIAA0511 

110 ADCY4 

111 ADCY5 

112 ADCY6 

113 ADCY7, KIAA0037 

114 ADCY8, ADCY3, HBAC1 
H5ADCY9 

2977 GUCY1A2, GUC1A2, GC-SA2 

, GUCY1A3, GUC1A3, GUCSA3, GO 



cCMP -> CMP 

cAMP->AMP 

cAMP -> AMP 

CAMP -> AMP 

cAMP->AMP 

cGMP -> GMP 

cGMP -> GMP 

cGMP -> GMP 

cGMP -> GMP 

cGMP -> GMP 

cAMP -> AMP 

cAMP -> AMP 

cdAMP -> dAMP 

c!MP -> IMP 

cGMP ■> GMP 

cCMP -> CMP 

cAMP->AMP 
cAMP -> AMP 
cdAMP -> dAMP 
cIMP ■> IMP 
cGMP -> GMP 
cCMP -> CMP 
cGMP->GMP 
cGMP->GMP 
ADN->INS + NH3 
DA -> DIN + NH3 
AMP->IMP + NH3 
AMP -> IMP + NH3 
AMP -> IMP + NH3 



2982 
2983 



SA3 



SB3 

2SS4 GUCY2C, GUC2C, STAR 

oooe GUCY2F, GUC2F, GOJ=, GUC2DL, 

^RETGO* 

GUCY2D, CORD8. GUC2D. LCA1, 
afflB GUC1A4, LCA, retGC 
4881 NPR1, ANPRA, GUC2A, NPRA 
„ aoo NPR2, ANPRB, GUC2B, NPRB, 
^NPRBi 
1SADSS 
31B NUDT2, APAH1 

f ENPP1, M6S1, NPPS, PCA1, PC-1, 
- PDNP1 

5168 ENPP2, ATX, PD-IALPHA, PDNP2 
5159 ENPP3, PD-IBETA, PDNP3 



22Z2FHIT 

4.2 Pyrimidine metabolism PATH:hsa00240 
790 CAD 



1723 DHODH 
2222 UMPS, OPRT 



ATP-> 
ATP-> 
ATP-> 
ATP-> 
ATP -> 
ATP-> 
ATP-> 
ATP-> 
ATP~> 



cAMP + PPI 
cAMP + PPI 
cAMP + PPI 
cAMP + PPI 
cAMP + PPI 
cAMP + PPI 
cAMP + PPI 
cAMP + PPI 
cAMP + PPI 



GUCY1 B3, GUC1B3, GUCSB3. GC- 



IMP + GTP + ASP -> GDP + PI + ASUC 



GLN + 2 ATP + CQ2 -> GLU + CAP + 2 ADP + PI 

CAP + ASP -> CAASP + PI 

CAASP <-> DOROA 

DOROA + 02 <-> H202 + OROA 

OMP -> CQ2 + UMP 



3,1.4.17 
3.1.4,17 
3.1.4.17 
3,1.4.17 
3,1.4.17 
3.1.4.17 
3.1.4.17 

3-1-4.17 

3.1.4.17 
3.1,4.17 



3.1.4.17 



3.1.4.17 

3.1.4.17 
3.5.4.4 



3.5.4.6 
3.5.4.6 
3.5.4.6 
3.6.1.5 

3.6.1.19 

4.6.1.1 

4.6.1.1 

4.6.1.1 

4.6,1.1 ' 

4.6.1.1 

4.6.1.1 

4.6.1.1 

4-6.1.1 

4.6.1.1 

4.6.1.2 

4,6.1.2 
4,6.1.2 

4 6.1.2 
4.6.1.2 

4 6.1.2 

4.6.1.2 

4.6.1.2 

6.3.4.4 
3.6.1.17 

3£JL9 

3£J-9 
• 3.1.4.1 

3.6.1.29 

ZJULZ. 

1,3.3.1 
4,1.1.23 
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51Z2ZLOC51727 



5QSQSAKL3L 
1503 CTPS 

jr3ZlUMPK,TSA903 



7378 UP 

1806 DPYD, DPD 

1807 DPYS. DHPase, DHPASE, DHP 
51733 LOC51733 

7295 TXNRD1, TXNR 
1854 PUT 

7298 TYMS. TMS.TS 
S2B CDA, CDD 

J£25DCTD 

7083 TK1 

7084 TK2 



1841 DTYMK, TYMK, CDCB 
4.3 Nucleofide sugars metabolism PATH:hsa00520 

23483 TDPGD 

14B6 CTBS. CTB 
5. Amino Acid Metabolism 
5.1 Glutamate metabolism PATH:hsa00251 

fig59 ALDH4, P5CDH 

2D5a EPRS, QARS, QPRS 
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OROA + PRPP <-> PPI + OMP 
ATP + UMP<->ADP + UDP 
CMP + ATP<->ADP + CDP 
DCMP + ATP <-> ADP + DCDP 

UTP + GLN + ATP -> GLU + CTP + ADP + Pi 
ATP + UTP + NH3 -> ADP + PI + CTP 
URI + ATP->ADP + UMP 
URI + GTP->UMP + GDP 
CYTD + GTP -> GDP + CMP 
URI + Pl<->URA + R1P 



OTHIO + NADPH -> NADP + RTHIO 
DUTP -> PPI + DUMP 
DUMP + METTHF -> DHF + DTMP 
CYTD->URI + NH3 
DC->NH3 + DU 
DCMP <-> DUMP + NH3 
DU + ATP -> DUMP + ADP 
DT + ATP -> ADP + DTMP 
DUm + ATPm -> DUMPm + ADPm 
DTm + ATPm -> ADPm + DTMPm 
DTMP + ATP <-> ADP + DTDP 



P5C + NAD + H20 -> NADH + GLU 
GLU + ATP -> GTRNA + AMP + PPI 



26Z3 GFPT1, GFA, GFAT, GFPT 
9945 GFPT2, GFAT2 
5359 QARS 

2Z^ GLCLC, GCS, GLCL 
222Q GLCLR 
2S32 GSS, GSHS 
2336 GSR 

MfifiPET112UPET112 
5.2 Alanine and aspartate metabolism PATH:hsa00252 
4677 NARS, ASNRS ATP 
435 ASL • 
189 AGXT, SPAT 



F6P + GLN->GLU + GA6P 
F6P + GLN->GLU + GA6P 



CYS + GLU + ATP -> GC + PI + ADP 
CYS + GLU+ATP->GC + Pl + ADP 
GLY + GC + ATP^>RGT + Pl + ADP 
NADPH + OGT -> NADP + RGT 



ATP + ASP + TRNA -> AMP + PPI + ASPTRNA 
ARGSUCC -> FUM + ARG 
SERm + PYRm <-> ALAm + 3HPm 
ALA + GLX <-> PYR + GLY 



15 AARS 
1615 DARS 

445 ASS, CTLN1, ASS1 
443 ASPA, ASP, ACY2 
1384 CRAT, CAT1 



CITR + ASP + ATP <-> AMP + PPI + ARGSUCC 



ACCOA + CAR -> COA + ACAR 



3523 DDO , nMtx 
5.3 Glycine, serine and threonine metabolism PATH:hsa00260 
. + n 



5223 PSPH, PSP 
29353 PSA 

2fi22I PHGDH, SERA, PGDH, PGD, PGAD 
23454 GCAT, KBL 

211 ALAS1, ALAS 

212 ALAS2, ANH1, ASB 

4128 MAOA 

4129 MAOB 

25 ABP1,AOC1,DAO 
314 AOC2, DA02, RAO 
fi£32 AOC3, VAP-1. VAP1, HPAO 
2731 GLDC 



3PSER + H20->PI + SER 
PHP + GLU <-> AKG + 3PSER 
OHB + GLU <-> PHT + AKG 
3PG + NAD <-> NADH + PHP 

SUCCOA + GLY -> ALAV + COA + C02 
SUCCOA + GLY -> ALAV + COA + CQ2 
AMA + H20 + FAD ■> NH3 + FADH2 + MTHGXL 
AMA + H20 + FAD ■> NH3 + FADH2 + MTHGXL 



GLY + UPO<->SAP + C02 



9 4.2.10 
7 7 4.14 



2.7.4.1Q. 

6.34.2 

2.7.1.48 



2.4.2.3 
1-31.2 

3,5X6 

1.6.4.5 
3^X23 

21-1.45 
3-5.4.5 

3-5,4.12 
2.7.1,21 

9-7.1.21 

9.7.4.9 

4 2.1.46 
3.2.1. - 



1 5.1.12 
fi-1.1.17 
6.1.1.15 
96.1.16 
9 61.16 
R1.118 

6 3 9-3 
1.6.4.2 
6.3.5,- 

9 6 1.51 
96 1.44 
61.1.7 
61 1.12 
63.4.5 
3, 5-1-15 
9 3.1.7 

1 4.3.1 

3^3.3 

Z3X52L 



1 , 11.95 

2 ^1.29 

? 3-1.37 

2^.1,37 

1 4.3.4 

1 4.3.4 

1-4.3.6 

1 , 4.3.6 

1, 4.3.6 

14.4.2 
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1£10 DAO, DAMOX 
2617 GARS 
2628 GATM 
2523 GAMT 

PISD, PSSC, DKFZP566G2246. 
23761 DJ858B16 

£35 BHMT 
29258 DMGDH 
875 CBS 
£301 SARS, SERS 
1G233 SDS, SDH 
fifiSZTARS 
5.4 Methionine metaboEsm PATH:hsa00271 



PS->PE + C02 

SER + HCYS -> LLCT + H20 
SER->PYR + NH3 + H20 



4143 MAT1A, MATA1, SAMS1, MAT, SAMS MET + ATP + H20 -> PPI + PI + SAM 



MET + ATP + H20 -> PPl + PI + SAM 
SAM + DNA-> SAH + DNA5MC 
SAH + H20-> HCYS + ADN 
SAH + H20 -> HCYS + ADN 



HCYS + MTHF -> THF + MET 



4144 MAT2A, MATA2, SAMS2, MATH 
1ZBS DNMT1, MCMT, DNMT 
1D76B AHCYL1 , XPVKOMA 

191 AHCY, SAHH 
4141 MARS, METRS, MTRNS 
4548 MTR 

5.5 Cysteine metaboDsm PATH:hsa00272 

833 CARS 
Jfl35 CDOl" 
S5Q9 NDST2, HSST2, NST2 

5.6 Valine, leucine and isoleucine degradation PATH:hsa00280 

586 BCAT1, BCT1, ECA39, MECA39 AKG + ILE -> OMVAL + GLU 

AKG + VAL -> OIVAL + GLU 



CYS + 02 ■ 



►CYSS 



5SZ BCAT2, BCT2 

5014 OVD1A 
593 BCKDHA, MSUD1 



594 BCKDHB, E1B 



3712 IVD 
315 AOX1.AO 



AKG + LEU -> OICAP + GLU 
OICAPm + GLUm <-> AKGm + LEUm 
OMVALm + GLUm <-> AKGm + ILEm 

OMVALm + COAm + NADm -> MBCOAm + NADHm + C02m 
OtVALm + COAm + NADm -> IBCOAm + NADHm + C02m 
OICAPm + COAm + NADm -> IVCOAm + NADHm + C02m 
OMVALm + COAm + NADm -> MBCOAm + NADHm + C02m 
OIVALm + COAm + NADm -> IBCOAm + NADHm + C02m 
OICAPm + COAm + NADH -> IVCOAm + NADH m + C02m 
IVCOAm + FADm -> MCRCOAm + FADH2m 



1.4.3.3 
6.1-1-14 
2.1,4.1 
2.1. 1 , 2 

4.1.1,65 

2.1 .1 .5 
1-5*33,2. 

4.2.1.22 
6.1.1.11 
4.2.1.13 
6.1.1.3 

2^Lfi 

2.5.1.6 

2.1.1,37 

3.3.1.1 

3.3.1.1 

6.1.1.10 

2.1.1.13 

6.1.1.16 

1.13.11.20 

2.8.2.- 

2.6.1.42 



2.6.1.42 

1.2.4.4 
1.2.4.4 



4164 MCCC1 



4165 MCCC2 



MCRCOAm + ATPm + C02m + H20m -> MGCOAm + ADPm + 
Pirn 

MCRCOAm + ATPm + C02m + H20m -> MGCOAm + ADPm + 
Pirn 

5.7 Valine, leucine and isoleucine biosynthesis PATH:hsa00290 
23395 K1AA0028, LARS2 

3225 LARS 
33Z5 IARS. ILRS 

7406 VARS1 , VARS 

7407 VARS2, G7A 

5.8 Lysine biosynthesis PATH±isa00300 

2Z35 KARS, WAA0070 ATP + LYS + LTRNA -> AMP + PPI + LLTRNA 

5.9 Lysine degradation PATH:hsa0D310 

£424 BBOX, BBH. GAMMA-BBH. G-BBH 

5351 PLOD, LLH 

5352 PLOD2 
£935 PLOD3, LH3 

10157 LKR/SDH, AASS LYS + NADPH + AKG -> NADP + H20 + SAC 

SAC + H20 + NAD -> GLU + NADH + AASA 

5.10 Arginine and proline metabofism PATH ±tsaO0 330 

3109 OTC ORNm + CAPm -> CITRm + Pbn + Hm 

333ARG1 ARG -> ORN + UREA 

^4 AR62 ARG -> ORN + UREA 

45*2 NOS1.NOS 
4343 NOS2A, NOS2 
4545 NOS3. ECNOS 
v. 4942 OAT* ORN + AKG <-> GLUGSAL + GLU 



1.2.4.4 



1.3-99.10 
1.2.3.1 

6.4.1.4 
6.4.1.4 

6.4,1.4 
6.4.1.4 
6.1.1.5 
6.1.1.9 

6.1.1.9 

6.1.1.6 

1.14.11.1 
1.14.11.4 
1.14.11.4 
1.14.11.4 
1.5.1.9 



2X3-3 

3.5.3.1 

3.5.3.1 

1 14.1339 

1.14.1339 

114.13-39 

2,6.1.13 
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5831 PYCR1, P5C, PYCR 



P5C + NADPH -> PRO + NADP 
P5C + NADH -> PRO + NAD 
PHC + NADPH -> HPRO + NADP 
PHC + NADH -> HPRO + NAD 

ATP + ARG + ATRNA -> AMP + PPI + ALTRNA 
PCRE + ADP -> CRE + ATP 



PTRSC + SAM -> SPRMD + 5MTA 
SAM <-> DSAM + C02 
SAM <-> DSAM + C02 
SPRMD + Qm -> DAPRP + QH2m 
DSAM + SPRMD -> 5MTA + SPRM 
ORN-> PTRSC + C02 



FIGLU + THF -> NFTHF + GLU 



ACAL + NAD -> NADH + AC 
ACAL + NAD -> NADH + AC 
ACAL + NAD -> NADH + AC 
ACAL + NAD •> NADH + AC 
ATP + HtS +• HTRNA -> AMP + PPI + HHTRNA 

AKG + TYR -> HPHPYR + GLU 
HPHPYR + 02 -> HGTS + C02 
HGTS + 02 -> MACA 
MACA -> FACA 



3133 P4HA1, P4HA 
£917 RARS 
1 152 CKB, CKBB 
1156 CKBE 

1158 CKM, CKMM 

1159 CKMT1, CKMT, UMTCK 

1160 CKMT2. SMTCK 
S23SRM, SPS1, SRML1 

2S2 AMD1, ADOMETDC 

2B3 AMDP1 , AMD, AMD2 
1725 DHPS 
S6HSMS 
4953 ODC1 
£302 SAT, SSAT 

5.11 Histidine metabolism PATH:hsa0034O 
10841 FTCD 

3067 HDC 
1644 DDC, AADC • 
3176 HNMT 
215 ALDH3 

220 ALDH6 

221 ALDH7, ALDH4 

222 ALDH8 
3035 HARS 

5.12 Tyrosine metabolism PATH:hsa00350 

sssbtat 

3242 HPD, PPD 
3QS1 HGD, AKU, HGO 
2954 GSTZ1 , MAAI 

2184 FAH 
7299 TYR, OCAlA 
7054 TH.TYH 
JS21DBH 
5409 PNMT, PENT 
1312 COMT 
ZU3 TPO, TPX 

5.13 Phenylalanine metaboRsm PATH*sa00360 

501 ATQ1 

5.14 Tryptophan metabolism PATH:hsa00380 
£222TD02,TPH2,TRPO,TDO TRP + 02->FKYN 

8564 j^jq KYN + NADPH + 02 -> HKYN + NADP + H20 

5242 KYNU KYN -> ALA + AN 

HKYN + H20 -> HAN + ALA 
2349B HAAO. HAO, 3-HAO HAN + 02 -> CMUSA 

7166 TPH, TPRH 
438 ASMT, HIOMT, ASMTY 
15 AANAT, SNAT 
3520 INDO, IDO 
^0352 WARS2 
7453 WARS, IFP53, 1FI53, GAMMA-2 
4734 NEDD4, WAA0093 
5 15 Phenylalanine, tyrosine and tryptophan biosynthesis PATH:hsa00400 

5Q53 PAH, PKU1 PHE + THBP + 02 -> TYR + DHBP + H20 

10667 FARS1 
2133 FARSL, CML33 
1QQ5S PheHB 
5555 YARS, TYRRS, YTS, YRS 
5.16 Urea cycle and metabolism of amino groups PATH:hsaQ0220 
5532 PYCS 

GLUP + NADH ■> NAD + Pi + GLUGSAL 
GLUP + NADPH -> NADP + PI + GLUGSAL 



FACA + H20 -> FUM + ACA 



ATPm + TRPm + TRNAm -> AMPm + PPlm + TRPTRNAm 
ATP + TRP + TRNA -> AMP + PPI + TRPTRNA 



1.5.1.2 



1 14.11.2 

61 1.19 

2.7.3.2 

2X32 

2.7.3.2 

2.7.3.2 

27.3.2 

2.5.1.16 

4.1.1.50 

4-1-1-50 

1.5.99.6 

2.5X22. 

4,1.1.17 

2 ,3.1.57 

2X2.5 
4.3.1.4 
4.1.1.22 
4 1.1.28 
2.1.1.8 

1.2.1.5 

1.2.1.5 

1.2.1.5 

1,2.1.5 

6.1.1^1 

2.6.1.5 

1 13-11-27 
1,13.11-5 
52X2 

2 5.1.18 
17.1.2 

1 14-18.1 
1 14.16.2 
1 14-17.1 
2-1 1-28 
21.1.6 
1.11,1.6- 

1-2-1-- 

1.13.11.11 
1 14.13.9 
3.7.1.3 

1 13.11.6 

1 14.16.4 

2.1.1.4 

2.3.1.87 

1-13.11.42 

5X12 

R1-1-2 

522^ 



1-14.16.1 
5X120 
5X120 
6X120 
1-1.1 

?, r 7-2.11 

12X41 
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25 ACY1 
6. Metabolism of Other Amino Adds 
6.1 beta-Alanine metabolism PATH:hsa00410 
62 Taurine and hypotaurine metaboTism PATH:hsa00430 
, GGT1, GTG, D22S672, D22S732, 
! GGT 
2679 GGT2, GGT 
2m GGT3 



2SZS, 



RGT + ALA •> CGLY + ALAGLY 

RGT + ALA -> CGLY + ALAGLY 
RGT +AU-> CGLY + ALAGLY 



2587 GGTLA1 , GGT-REL, DKFZP566O01 1 RGT + ALA -> CGLY + ALAGLY 

6.3 Aminophosphonate metabolism PATH:hsaD0440 

5130 PCYT1A, CTPCT. CT, PCYTf PCHO + CTP -> CDPCHO + PPI 

9791 PTDSS1 , KIAA0Q24, PSSA CDPDG + SER <-> CMP + PS 

6.4 Selenoamino acid metabolism PATH ^sa 004 50 

22928 SPS2 

22929 SPS. SELD 

6.5 Cyanoamino add metabolism PATH:hsa00460 

6.6 D-Glutamine and D-gtutamate metabolism PATH:hsa00471 

6.7 D-Arglnine and D-omithine metabolism PATH:hsa00472 
6.9 Glutathione metabolism PATH:hsa00480 

51fi2 PEPB 
2655 GCTG 
2S26 GPX1.GSHPX1 
2877 GPX2, GSHPX-GI 
2S2fi GPX3 
2829 GPX4 

2880 GPX5 

2881 GPX6 
2338 GSTA1 
2329 GSTA2, GST2 

2940 GSTA3 

2941 GSTA4 
2944 GSTM1, GST1. MU 

2946 GSTM2, GST4 

2947 GSTM3, GST5 

2948 GSTM4 

2949 GSTM5 

2950 GSTP1. FAEES3, DFN7, GST3, PI 

2252 GSTT1 

2253 GSTT2 

4257 MGST1, GST 12, MGST, MGST-I 

425B MGST2, GST2, MGST-I] 

4252 MGST3, GST-Ill 
7. Metabofism of Complex Carbohydrates 
7.1 Starch and sucrose metabofism PATH:hsa00500 

S4ZS SI 



2 RGT + H202 <-> OGT 
2RGT + H202 <-> OGT 
2RGT + H202<->OGT 
2 RGT + H202 <-> OGT 
2RGT + H202<-> OGT 
2RGT + H202<->OGT 



HlfilTREH.TRE.TREA 
299Q GUSB 
2532 GBE1 
5834 PYGB 
5535 PYGL 
£837 PYGM 
229Z GYS1, GYS 
2228 GYS2 

276 AMY1A, AMY1 

277 AMY1B, AMY1 
2ZB AMY1C, AMY1 
222 AMY2A. AMY2 
2SQ AMY2B, AMY2 
178 AGU GOE 

10000 AKT3. PKBG, RAO-GAMMA, PRKBG 

1017 CDK2 

1018 CDK3 

1019 CDK4. PSK-J3 
ifi2Q COK5, PSSALRE 



TRE -> 2 GLC 

GLYCOGEN + PI -> G1P 
GLYCOGEN + PI ->G1P 
GLYCOGEN + PI -> G1P 
GLYCOGEN + PI ->G1P 
UDPG -> UDP + GLYCOGEN 
UDPG •> UDP + GLYCOGEN 



3.5.1.14 



2322. 

2222. 
2222. 



2,7.7.15 

2.7.8.- 



2.7.9.3 
27,9.3 



3.4.11.4 
2.3.2.4 
1.11.1.9 
1.11.1.9 

1.11.1.9 
1.11.1.9 
1.11.1.9 
1.11.1,9 
Z5.1.18 
2.5.1,18 
2.5.1. 18 
2.5.1.18 
2.5.1.18 
2.5.1.18 
2,5.1.16 
2.5.1.18 
2.5.1.18 
2.5.1.18 
2.5.1.18 
2.5.1.18 
2.5.1.1ft 
2.5.1.18 
2.5.1.18 



3,2.1. 10 
3.2.1.48 

3.2.1.28 

3.2.1,31 

2,4,118 

2.4.1.1 

2.4.1,1 

24.1.1. 

2.4.1.11 

2.4.1,11 

3.2.1.1 

3.2.11 

3,2.11 

3^.1.1 

3^.11 

74.1.25 

3JU-33. 
2.7.1- 

2.7.1. - 
2.7.1. - 

2,7.1- 
2.7,1- 
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1£21 CDK6, PLSTIRE 

1022 CDK7, CAK1, STK1, CDKN7 

1024 CDK8, K35 

JJJ25 CDK9, PITALRE, CDC2L4 
10298 PAK4 
10746 MAP3K2, MEKK2 

11H CHEK1, CHK1 
11200 RAD53, CHK2, CDS1, HUCDS1 

1195 CLK1 t CLK 

1326 MAP3K8, COT, EST, ESTF, TPL-2 
MAPK14, CSBP2, CSPB1, PRKM14, 
PRKM15, CSBP1, P3B, MX12 

1452 CSNK1A1 

1453 CSNK1D, HCKID 

1454 CSNK1E, HCWE 

1455 CSNK1G2 

1456 CSNK1G3 
1612 DAPK1, DAPK 

1760 DMPK, DM, DMK, DM1 

1659 DYRK1A, DYRK1, DYRK, MNB, MNBH 

2Q8 AKT2, RAC-BETA, PRKBB, PKBBETA 

269 AMHR2, AMHR 
2Z33Q RPS6KA6, RSK4 

2SSS GPRK2L, GPRK4 

2859 GPRK5, GRK5 

2SZD GPRK6, GRK6 
29904 HSU93B50 
30811 HUNK 

3611 ILK, P59 

3654 IRAKI, IRAK 

369 ARAF1, PKS2, RAFA1 

370 ARAF2P, PKS1, ARAF2 
3984UMK1.UMK 

3935 UMK2 

4117 MAK 

4140 MARK3, KP78 

4215 MAP3K3, MAPKKK3, MEKK3 
MAP3K4, MAPKKK4, MTK1 , MEKK4, 

4216 KIAA0213 

4217 MAP3K5, ASK1, MAPKKK5, MEKK5 

4293 MAP3K9, PRKE1, MLK1 

4294 MAP3K10, MLK2. MST 
4342 MOS 

4751 NEK2, NLK1 

4752 NEKS 

5Q58 PAK1,PAKalpfia 

5(262 PAK2, PAK65. PAKgamma 

5Q63 PAK3, MRX30, PAK3beta 

5127 PCTK1 , PCTGAIRE 

5128 PCTK2 

£129 PCTK3, PCTAIRE 
5292PIM1.PIM 
5242 PLK, PLK1 
5§52 PRKAA1 

5563 PRKAA2, AMPK, PRKAA 
55ZS PRKCA, PKCA 

55Z9 PRKCB1, PKCB, PRKCB, PRKCB2 
55BQ PRKCD 
55B1 PRKCE 

5562 PRKCG, PKCC, PKCG 
5583 PRKCH, PKC-L, PRKCL 

5564 PRKC1, DXS1179E, PKCI 

5565 PRKCL1, PAK1, PRK1, DBK, PKN 
5586 PRKCL2, PRK2 

55BS PRKCQ 
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2,7.1- 
27.1.- 
2,7,1- 
2.7.1.-. 

2.7.1. - 
2.7.1 
2.7.1. - 
2.7.1. - 
2,7,1,-. 
2 7.1- 

2Xlr 

2.7.1 . - 
2.7.1 . - 
27.1.- 
27.1.- 
2.7.1 .- 
2.7.1 . - 
2.7.1 . - 

2.7.1 ,- 

2,7,1,- 

2.7.1. - 
2.7.1 
2.7,1,- 
2.7.1. - 
2.7. 1.- 
2.7.1 .- 
2.7. 1.- 
2.7.1. - 
2.7.1. - 
2.7.1. - 
2.7. 1.- 
2.7.1 .- 
2.7.1.- 
2.7.1- 
2.7.1- 
27.1.- 

2.7. 1.- 

27.1- 
2.7.1 
77.1.- 
7.7.1- 

2,7.1." 
2.7.1. - 

2.7.1. - 
2.7.1. - 
2.7.1 .- 
27.1.- 
27.1.- 
2.7.1.- 
2-7.1.- 
2-7.1.- 
2.7.1 .- 
7.71- 

2.7.1- 
2,7 1 - 
27.1,- 
2,7.1.- 

7.7.1. - 
2JLXz 

, 2X1= 
2JJLc 
2.7.1.- 
2.7.1. - 
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559Q PRKCZ 2 .7.1 c 
MAPK1, PRKM1, P41MAPK, 

£594 P42MAPK, ERK2, ERK, MAPK2, 2.7.1. = 
PRKM2 

MAPK3, ERK1, PRKM3, P44ERK1 , 

52225 P44MAPK ZJJ -=- 

5592 MAPK6, PRKM6, P97MAPK, ERK3 2.7.1- 

5593 MAPK7, BMK1 , ERK5, PRKM7 2.7.1 .- 
MAPKB, JNK, JNK1, SAPK1, PRKMB. 



JNK1A2 



2.7.1- 



™. MAPK9, JNK2, PRKM9, P54ASAPK, 

5501 JUNKINASE 2XLz 

MAPK10, JNK3, PRKM10, P493F12, 
382 P54BSAPK 2XI ^ 

MAPK13, SAPK4, PRKM13, 
5203 P38DELTA 

MAP2K1, MAPKK1. MEK1, MKK1, _ , , 

PRKMK1 

MAP2K2, MEK2, PRKMK2 2.7.1 

5606 MAP2K3, MEK3, MKK3, PRKMK3 2.7.1 - 

5607 MAP2K5, MEK5, PRKMK5 2.7.1- 
( MAP2K6, MEK6, MKK6, SAPKK3, 

PRKMK6 

MAP2K7, MAPKK7, MKK7, PRKMK7, 
JNKK2 



2.7.1 - 
2.7.1 c 

551Q PRKR, EIF2AK1, PKR 2.7.1. - 

5613 PRKX, PKX1 2.7.1- 

5824 RAF1 2.7.1. - 
BCR, CML, PHL, BCR1, D22S11, 

^ D22S662 ?i7 . 1r 

RPS6KA1, HU-1, RSK. RSK1. _ _ . 

MAPKAPK1A 2ZJ ^ 
c . ne RPS6KA2, HU-2, MAPKAPK1C, RSK, 

5135 RSK3 2ZJ ^ 

RPS6KA3, RSK2, HU-2, HU-3, RSK, 

S22S1 MAPKAPK1 B, ISPK-1 2,7,1 r 

6198 RPS6KB1 . STK14A 2.7.1- 

6199 RPS6KB2, P70-BETA, P70S6KB 2.7.1. - 
fi , nn MAPK12, ERK6, PRKM12, SAPK3, _ _ . 

P38GAMMA. SAPK-3 
MAP2K4, JNKK1, MEK4, PRKMK4, 

5415 SERK1 , MKK4 2,7,1 " 

£44£ SGK 2.7.1. - 

£58 BMPR1B, ALX-6, ALK6 * . 2.7.1 - 

£59 BMPR2, BMPR-II, BMPR3, BRK-3 2.7.1- 

£ZS BRAF 2.7.1- 

6792 STK9 2.7. 1.- 

£Za4STK11 t LKB1,PJS ?L7.1. - 



MAP3K7, TAK1 2.7.1- 

£29 BUB1 2.7.1.- 

ZQ1 BUB1B, BUBR1, MAD3L 2.7.1. - 

ZQlfi TESK1 2.7. 1.- 

7222 TTK, MPS1 L1 2.7.1. - 

7867 MAPKAPK3, 3PK, MAPKAP3 2.7.1- 



i ULK1 2.7.1- 

£558 CDK10, PISSLRE Z7.1- 

£521 CDC2L5, CDCZL, CHED 2.7.1- 

£Z2Z R1PK1.RIP 2.7.1- 

££14 CDKL1. KKIALRE ?.7.1- 

££99 PRP4, PR4H 2.7.1- 

9024 MAP3K6. MAPKKK6 2 7.1- 

9149 DYRK1B 2.7.1- 

92 ACVR2, ACTRII 2.7.1- 

2211 DCAMKL1, KIAA0369 2.7.1- 

23 ACVR2B 2.7.1- 

2£3 CDC2 Z7.1- 

2MCDC2L1 2.7.1- 
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DHPP -> DHP + PI 
GTP -> GSN + 3 P! 
DGTP->DG + 3PI 



1Z9B 



4245, 



4241 



72 Glycoprotein biosynthesis PATH:hsaOD510 
. DPAGT1, DPAGT, UGAT, UAGT, 
1 D11S366, DGPT, DPAGT2, GPT 
228SQ ALG5 
8813 DPMI 

1650 DDOST, OST, OST48, KIAA0115 
6184 RPN1 
£125 RPM2 
10130 P5 
10954 PDIR 
11008 PDI 

GRP58, ERp57, ERp60, ERp61, 
2923 GRP5A P58, PI-PLC, ERP57, ERP60, 
ERP61 

5034 P4HB, PROHB, P04DB, ERBA2L 
7B41 GCS1 

4121 MAN1A1, MAN9, HUMM9 
. MGAT1, GLYT1, GLCNAC-Tl, GhfT-l, 
2 MGAT 

4122 MAN2A2, MANA2X 
4124 MAN2A1, MANA2 

r MGAT2, CDGS2, GNT-II, GLCNACTll. 
" GNT2 
4248 MGAT3, GNT-M 
SIAT6, ST3GAUI 
fi4SQSlAT1 

2329 FNTA, FPTA, PGGT1A 
2242 FNTB, FPTB 
5229 PGGT1B, BGGl, GGTI 
58Z5RABGGTA 
5 876 RABGGTB 
1352 COX10 

7.3 Glycoprotein degradation PATH:hsa0051 1 

4758 NEU1, NEU 
3QI3 HEXA, TSD 
3074 HEXB 

4123 MAN2C1, MANA, MANA1 , MAN6A8 

4125 MAN2B1, MANB, LAMAN 

4126 MANBA, MANB1 
2512 FUCA1 
2519FUCA2 ■ 

175 AGA, AGU 

7.4 Aminosugars metabolism PATH:hsa00530 

fiSZS UAP1, SPAG2, AGX1 
1QQ2Q GNE, GLCNE 
22951 CMAS 
1727 DIA1 
4669 NAGLU, NAG 
73 Upopolysaccharide biosynthesis PATH:hsaO0540 
j>485 S1AT5, SAT3, STZ 
7903 S1AT6D, PST, PST1, ST8SIArlV 
p128 SIAT8B, STX, ST8SIA-I! 
7.7 Glycosamlnoglycan degradation PATH:hsa0O531 
?423 IDS, MPS2, SIDS 
3425 IDUA. IDA 
411 ARSB 
7799 GNS, G6S 

7588 GALNS, MPS4A, GALNAC6S, GAS 
8. Metabolism of Complex Uplds 
8.1 Glycerofipld metabolism PATHAsaO0561 



GDPMAN + DOLP -> GDP + DOLMANP 



UTP + NAGA1P <-> UDPNAG + PPJ 



PCT/US03/09751 



.3,6,1- 



2.7.8.15 
2.4,1.117 

7.4.1.83 
7.4,1,119 
2.4,1.119 
2.4.1. 119 . 

5,3.4.1 
fl.3.4.1 
fi.3.4.1 



5.3.4.1 

fi.3.4.1 
3.2.1. 106_ 
32.1.113 

2.4.1.101 

3 2.1.114 
3-2.1.114 

2 4.1.143 

24.1.144 

2.4.99.6 

2.4.99.1 

2,5.1.- 

2.5.1.- 

7 5.1- 

2,5,1.- 

2.5.1 

2.5.1,- 

3-2.1.18 
3.2.1.52 

32X52. 
3.2.1.24 

3.2.1.24 

3.2.1.51 
3 2.1.51 
3^L26_ 

27.7.23 
5 1.3.14 
77.7.43 

37.130 

74.99.- 
7 4.99.- 
7 4.99.- 

3J.6.13 

3 7.1.76 

3X&12- 

31.6.14 
31-6.4 



1Q554 AGPAT1, LPAAT-ALPHA, G15 



AGL3P + 0 017 C100ACP + 0.062 C120ACP + 0.100 C14QACP ■ 
0570 C160ACP + 0.169 C161ACP + 0.055 C180ACP + 0535 
C181ACP + 0.093 C182ACP -> PA + ACP 



2.3-1.51 
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AGL3P + 0.017 C100ACP + 0.062 C120ACP +0.100 C140ACP + 



ir>^«?^ Af^DATO I DAAT..P.PTA 
iUaga AV3r A 1 LrAA I H3C I A 




2 3 1 51 




C181ACP + 0 Q93 C182ACP -> PA + ACP 




lOUO UbNA, UAot^ UAbM 




2 7 1 1 07 


4czc\Q r\nim r\ a nw 
IQUo LHslNAa, UAofvS 




2 7 1 107 


JfiQa DoKU, DAGK4 




9 7 1 107 


croc r\j~±i/~7 r\Anifc unfirrcrA 




9 7 1 107 


croc n^t^c f» A f^tsc pk/^i/ - 




9 7 1 107 


P?g7 DGKD, DGKDo-XA, K.IAAU 14b 




9 7 1 107 


1120 CHHX 


Alr + OMU -> AUr + rutlU 


9 7 1 V> 


EKJ1 


ATP + cTnM -> ADP + Pel MM 




1119 CHK, CKJ 


ATP + CHO -> ADP + rCMU 




42 ACHE, YT 




3J.1..7. 


1103 CHAT 




2.3.1.6 


5322 PLD1 




3.1 .4.4 


26279 PLA2G2D, SPLA2S 




3.1.1.4 


30814 PLA2G2E 




3.1.1.4 


5319 PLA2G1B, PLA2, PLA2A, PPLA2 




3.1.1.4 


5320 PLA2G2A, M0M1, PLA2B, PLA2L 




3.1 .1.4 


5322 PLA2G5 




3.1.1.4 


8398 PLA2G6, IPLA2 




3.1.1.4 


R399 PLA2G10 SPLA2 




3.1.1.4 


1040 CDS1 


PA.+ CTP <-> CDPDG + PP1 


2.7.7.41 


10423 PIS 


CDPDG + MYOI -> CMP + PINS 


2.7.8.11 


2710 GK 


GL + ATP->GL3P + ADP 


2.7.1.30 


2fi2Q GPD2 


GL3Pm + FADm ■> T3P2m + FAD H 2m 


1.1.99.5 


2819 GPD1 


T3P2 + NADH <-> GL3P + NAD 


1.1.1.8 


248 ALPI 


AHTD -> DHP + 3 PI 


3.1.3.1 


249 ALPL, HOPS, TNSALP 


AHTD -> DHP + 3 PI 


3.1.3.1 


25Q ALPP 


AHTD -> DHP + 3 PI 


3.1.3.1 


251 ALPPL2 


AHTD -> DHP + 3 PI 


3.1.3.1 


439 ASNA1 , ARSA-I 




3-Q,1-16 




DAGLY + 0.017 C100ACP + 0.062 C120ACP + 0.100 C140ACP 




B694 DGAT, ARGP1 


+ 0.270 C160ACP + 0.169 C161ACP + 0.055 C180ACP + 0235 


2-3X20. 




C161ACP + 0.093 C182ACP -> TAGLY + ACP 




33B9UPB 




3.1.1.3 


2920 UPC, HL 




3.1.1.3 


5406 PNLIP 




3.1.1.3 


5407 PNUPRP1, PLRP1 




3.1.1.3 


5408 PNUPRP2, PLRP2 




3.1.1.3 


8513 UPF, HGL, HLAL 




3.1.1.3 


4023 LPUUPD 




3.1.1.34 


B443 GNPAT, DHAPAT, DAP-AT 




2.3.1.42 



8540 



AGPS, ADAP-S,- ADAS, ADHAPS. 



ADPS, ALDHPSY 
4JS5 MDCR, MDS, LIS1 

5048 PAFAH1B1, US1, MDCR, PAFAH 

5049 PAFAH1B2 
5D50 PAFAH1B3 
5051 PAFAH2, HSD-PLA2 
7941 PLA2G7, PAFAH, LDL-PLA2 

82 Inositol phosphate metabolism PATH±isa00562 



2,5.1,2,6 

3.1.1.47 
3.1.1.47 
3.1.1.47 
3.1.1.47 
3.1.1.47 
3.1.1.47 



522QPIK3CA 

5291 PIK3CB, PIK3C1 

5233 PIK3CD 

5294 PIK3CG 

5297 PIK4CA. PWK-ALPHA 

5305PIP5K2A 

5330 PLCB2 

5331PLCB3 

5232 PLCD1 

5235 PLCG1, PLC1 

5335PLCG2 

3512 IMPA1, IMPA 

3513IMPA2 

2523 INPP1 



ATP + PINS -> ADP + PINSP 
ATP + PINS -> ADP + PINSP 
ATP + PINS ~> ADP + PINSP 
ATP + PINS -> ADP + PINSP 
ATP + PINS -> ADP + PINS4P 
P1NS4P + ATP -> D45P1 + ADP 
D45PI -> TP! + DAGLY 
D45PI -> TPI + DAGLY 
D45PI -> TPI + DAGLY 
D45PI -> TPI + DAGLY 
D45PI -> TPI + DAGLY 
MT1P->MY0I + PI 
MI1P->MY0I + PI 



2.7.1.137 

2.7.1.137 

2.7.1.137 

2.7.1.137 

2.7.1.67 

2.7.1. 68 

3.1.4.11 

3.1.4.11 

3.1.4.11 

3.1.4.11 

3.1.4.11 

3.1,3,25 

3.1.3.25 

3.1^^7 
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2622 INPP5A 
2£23 1NPP5B 
2S2S INPPL1, SHIP2 

OCRL, LOCR, OCRL1, INPP5F 
gB67 SYNJ1, INPP5G 
370S rfPKA 

51477 1SYNA1 G6P->M11P 

2521 INPP4A, INPP4 
8821 INPP4B 

8.3 SphingophosphoBpid biosynthesis PATH:hsa00570 

5609 SMPD1, NPD 

8.4 Phospholipid degradation PATH:hsa00580 

1178 CLC 

5221 PLA2G4A, CPLA2-ALPHA, PLA2G4 

8.5 Sphingoglycoliptd metaborism PATH:hsaOO600 

10 558 SPTLC1,LCB1,SPTI PALCOA + SER •> COA + DHSPH + C02 

9517 SPTLC2, KIAA0526. LCB2 PALCOA + SER->COA + DHSPH + C02 

427 ASAH, AC, PHP32 
7357 UGCG, GCS 

2522 GBA, GLUC 
25B3 GALGT, GALNACT 
6489 SIAT8A, SIATB, ST8SIA-I 
£481 SIAT2 

466B NAGA, D22S674, GALB 
9514 CST 
41 Q ARSA, MLD 

8.6 Blood group glycolipid biosynthesis - lact series PATH:hsa0060l 

28 ABO 

2525 FUT3, LE 
252Z FUT5, FUC-TV 
2528 FUT6 
2522 FUT1, H. HH 
2524 FUT2, SE 

8.7 Blood group glycolipid biosynthesis - neotact series PATH:hsa00602 

26S1 GCNT2, IGNT, NACGT1, NAGCT1 

8.8 Prostaglandin and leukotriene metabofism PATH:hsa00590 

222 ALOX12, LOG12 

246 ALOX15 

240 ALOX5 
4D56 LTC4S 
4048 LTA4H 

4051 CYP4F3, CYP4F, LTB4H 
£529 CYP4F2 

5742 PTGS1, PGHS-1 • 

5743 PTGS2, COX-2, COX2 
223QB PGDS 

5730 PTGDS 
5740 PTGIS, CYP8, PGIS 
6916 TBXAS1, CYP5 
BZSCBR1.CBR 



874 CBR3 
9. Metabofism of Cofactors and Vitamins 
9.2 Riboflavin metabolism PATH:hsa00740 

52 ACP1 

FMN -> RIBOFLAV + PI 

53 A CP2 FMN -> RIBOFLAV + PI 

54 ACP5, TRAP FMN -> RIBOFLAV + PI 

55 ACPP, PAP FMN -> RIBOFLAV + PI 
9 3 Vitamin B6 metabolism PATHiisa00750 

S56BPDXK,PKH,PNK PYRDX + ATP -> P5P + ADP 

PDLA + ATP -> PDLA5P + ADP 
PL + ATP-> PL5P+ADP 
9 4 Nicotinate and nicotinamide metabolism PATH:hsa00760 

22415. QPRT OA + PRPP ■> NAMN + C02 + PPI 



3.1.3,56 

3.1.3.56 

3-1-3.56 

3.1.3.56 

2.7.1 .12L 

5.5.1.4 

3.1.3.66 

3.1.3.66 

3,1,4,12. 

3.1.1.5 
3.1.1.5 

2r3.1,5Q. 

2.3.1.50 
2L5A22. 
2.4.1.80 
3.2.1.45 

2.4.1.92 
24.99.B 
2.4.99.2 
3.2.1.49 

3.1.6.8 

2,4.1.40 
2,4.1.37 
2,4.1,65 
2,4,1.65 

2.4.1.65 

2.4.1 .69 

2.4.1.69 

2,4.1,150 

1-13.11.31 
1.13.11.33 
1.33.11.34 

251-37 

1-1413.30 
1-14.13.30 
1.14,99.1 
1-14.99.1 

5.3.99.2 

5.3.99.4 

5.3.99.5 

1.1.1.184 

1-1.1.189 

1.1.1.197 

.1.1.1.184. 



3.1.3.48 
3.1.3,2 
3.1.3.2 
3.1.3.2 

2.7.1.35 



2.4.2.19 
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4837 NNMT 
£63 BST1,CD157 
252 CD38 
2353Q NNT 

9.5 Pantothenate and CoA biosynthesis PATH:hsa00770 

9.6 Btotin metabolism PATH:hsa0078Q 

2141 HLCS, HCS 



NAD -> NAM + ADPR1B 
HAD -> NAM + ADPRIB 



GTP -> FOR + AHTD 
DHF + NADPH -> NADP + THF 
THF + ATP + GLU <-> ADP + PI + THFG 



NADPH + DHBP -> NADP + THBP 



ATP + FTHF •> ADP + PI + MTHF 



fififiBTD 

9.7 Folate biosynthesis PATH:hsa00790 

2643 GCH1 , DYT5, GCH, GTPCH1 
1719 DHFR 
2356 FPGS 
££36 GGH, GH 
5805 PTS 
6697 SPR 

5660 QDPR, DHPR, PKU2 

9.8 One carbon pool by folate PATH:hsa00670 
1DB40 FTHFD 
10588 MTHFS 

9.10 Porphyrin and chlorophytl metabolism PATH±isa00860 

210 ALAD 2 ALAV -> PBG 

3145 HMBS, PBGD, UPS 4 PBG -> HMB + 4 NH3 

7390 UROS HMB -> UPRG 

23fiS UROD UPRG -> 4 C02 + CPP 

1371 CPO. CPX 02 + CPP -> 2 C02 + PPHG 

5498 PPOX, PPO 02 + PPHGm -> PPDCm 

2235 FECH, FCE PPIXm ^> PTHm 

3162HMOX1.HO-1 
£163 HM0X2, HO-2 

644 BLVRA, BLVR 

645 BLVRB, FLR 

2222 FDXR, ADXR 
3052 HCCS, CCHL 
1356 CP 

9.11 Ubiquinone biosynthesis PATH ^sa 00 130 
4238 OAS1, IFM, OlAS 

4939 OAS2, P69 

5557 PRIM1 

5558 PRIM2A, PRIM2 

5559 PRIM2B, PRIM2 

7015 TERT, EST2, TCS1. TP2, TRT 
fiSSfi OASU TRIP14 

10. Metabofism of Other Substances 

10.1 Terpenoid biosynthesis PATH±tsa0O9O0 

10.2 Flavonoids, stiibene and Bgnin biosynthesis PATH:hsa00940 

10.3 Alkaloid btosynffiests I PATHiisa00950 

10.4 Alkaloid biosynthesis II PATH:hsa00950 

10.6 Streptomycin biosynthesis PATH:hsa00521 

10.7 Erythromycin biosynthesis PATH:hsa00522 

10.8 Tetracycline fcosynthesls PATHmsa00253 

10.14 gamma^Sexachtorocyclohexans degradation PATH:hsa00361 

5444 PON1.ESA, PON 

5445 PON2 

10.18 1,2-Dichloroethane degradation PATH:hsa 00631 

10.20 Tetrachtoroethene degradation PATHJtsa00625 
2Q52 EPHX1, EPHX, MEH 

2053 EPHX2 

10.21 Styrene degradation PATH:hsaOOS43 

1 1 . TranscripSon (condensed) 

11.1 RNA polymerase PATH:hsa03020 



2.1.1.1 

3.2.2.5 
1.6.1.2 



6.3.4,- 
6.3.4.9 
6.3.4.1Q 
6,3.4.11 

6.3.4.15 
3.5.1.12 



3.5.4.16 

1.5.1.3 

63.2.17 

3.4.19.9 

4.6.1.10 

1.1.1.153 

1.6.99.7 

1.5.1.6 

4.2.1.24 

4.3.1.8 

4,2.1.75 

4.1.1.37 

■1.33.3 

1.3.3.4 

4.99.1.1 

1.14.39.3 

1.14.99.3 

1.3.1,24 

1.3.1.24 

1.6.99.1 

1.18.1.2 

4.4.1,17 

1.16.3.1 

2.7.7.. 
2.7.7.- 
2.7.7.- 
2.7.7- 
2JJU 
2.7.7.- 
2.7.7.- 



3.1.6.1 
3.1.1.2 
3JLL2 
3J.8-1 



2&23L 
3JL2*3_ 
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1 1 2. Transcription factors PATH:hsa03022 

12. Translation (condensed) 

12.1 Ribosome PATH:hsa03010 

12J2 Translation factors PATH:hsa0301 2 



1915 



EEF1A1, EF1A. ALPHA, EEF-1, 



1 EEF1A 
1917 EEF1A2, EF1A 
1938 EEF2, EF2, EEF-2 

12.3 AminoacyURNA biosynthesis PATH:hsa00970 
13. Sorting and Degradation (condensed) 
13.1 Protein export PATH:hsa03060 

2MZ0-SPC18 

13.4 Proteasome PATH:hsa03050 

5687 PSMA6, IOTA, PROS27 
5633 PSMA2, HC3, MU, PMSA2, PSC2 
56S5 PSMA4, HC9 

5688 PSMA7, XAPC7 
5££5 PSMA5, ZETA, PSC5 
5682 PSMA1, HC2, NU, PROS30 
5684 PSMA3, HC8 
5698 PSMB9, LMP2, RING12 
5625PSMB7.Z 
5691 PSMB3, HC1MI 
569Q PSMB2. HC7-1 
5622 PSMB5. LMPX, MB1 

5689 PSMB1, HC5, PMSB1 
5532 PSMB4, HN3, PROS26 

14. Replication and Repair 

14.1 DNA polymerase PATH:hsa03030 

14.2 Replication Complex PATH:hsa03032 
23526 SP011 

7153 TOP2A, TOP2 
7155 TOP2B 
Z155TOP3A,TOP3 
8940 TOP3B 

22. Enzyme Complex 

22.1 Bectron Transport System, Complex I PATH:hsa03100 

22.2 Bectron Transport System, Complex I! PATH:hsa03150 

22.3 Bectron Transport System, Complex ill PATH:hsa03140 

22.4 Electron Transport System, Complex IV PATH:hsa03130 

22.5 ATP Synthase PATH:hsa03110 
22.8 ATPases PATH:hsa03230 

23. Unassigned 

23.1 Enzymes 
5538 PPT1, CLN1, PPT, INCL 

23.2 Non-enzymes 
22334 RPIA, RPi 

525QSLC25A3, PHC 

55Z6 
51156 LOC51 166 

5625 PRODH 

6517 SLC2A4, GLUT4 

6513 SLC2A1, GLUT1, GLUT 
26275 HIBCH, HIBYL-COA-H 
23305 K1AA0837, ACS2, LACS5, LACS2 

8611 PPAP2A, PAP-2A 

8612 PPAP2C, PAP-2C 
8613PPAP2B, PAP-2B 

56994 LOC56994 
10400 PEMT.PEMT2 

5833 PCYT2, ET 
10390 CEPT1 

8394 P1P5K1A 

8395 PIP5K1B, STM7, MSS4 
6395 PIP5K2B _ • " ■■ 

2339B PIP5K1C, KIAA0589, PIP5K-GAMMA PINS4P + ATP -> D45PI + ADP 
24. Our own reactions which need to be found in KEGG 



3.6.1 .48. 

3.6.1 .46_ 
3.6.1.48 



3 4.21.89 

3-4.99.46 
3-4.99.46 
34.99.46 
3-4.99.46 

3 , 499.46 
3.4.99.46 
3„4.99.46- 

34.99.46 
34.99.46 
3.4.99.46 
3 4.99.46 
34.99.46 



5.99.1.3 

5i 99-1 -3 
5.99.1.3 
5.99.1.2 
5.99.1.2. 



C160ACP + H20 -> C160 + ACP 

RL5P<->R5P 
PI + H <-> Hm + Pirn 
CIT + MALm <-> CITm + MAL 
AADP + AKG -> GLU + KADP 
PRO + FAD -> P5C + FADH2 
GLCxt->GLC 
GLCxt->GLC 

HIBCOAm + H20m -> HIBm + COAm 
C160 + COA + ATP -> AMP + PPI + C160COA 
PA + H2O->0AGLY + PI 
PA+H20->DAGLY + PI 
PA + H20->DAGLY + PI 
COPCHO + DAGLY -> PC + CMP 
SAM + PE ■> SAH + PMME 
PETHM + CTP-> COPETN + PPI 
CDPETN + DAGLY <-> CMP + PE 
PINS4P + ATP -> D45P) + ADP 
PINS4P + ATP -> D45PI + ADP 
PINS4P + ATP -> D45PI + ADP 



3.1.2.22 
5,2*16 



2 6.1 .39 
.15,3.- 



3.1.2.4 
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amino acid metabolism 



GL3P <-> GL3Pm 
T3P2 <-> T3P2m 
PYR <-> PYRm + Hm 

ADP + ATPm + PI + H -> Hm + ADPm + ATP + Pirn 

AKG + MALm <-> AKGm + MAL 

ASPm + GLU + H -> Hm + GLUm + ASP 

GDP + GTPm + PI + H -> Hm + GDPm + GTP + Pirn 

C160Axt + FABP -> C160FP + ALBxt 

C160FP->C160 + FABP 

C1 80Axt + FABP -> C1 80FP + ALBxt 

C18DFP->C180 + FABP 

C161Axt + FABP -> C161FP + ALBxt 

C161FP->C161 + FABP 

C181Axt + FABP -> C181FP + ALBxt 

C181FP->C181 + FABP 

C182Axt + FABP -> C182FP + ALBxt 

C182FP->C182 + FABP 

C204Axt +'FABP -> C204FP + ALBxt 

C204FP -> C204 + FABP 

02xt->02 

02 <-> Q2m 

ACTACm + SUCCOAm -> SUCCm + AACCOAm 
3HB -> 3HBm 

MGCOAm + H20m -> H3MCOAm 4.2.1.18 

OMVAL -> OMVALm 

OIVAL->OIVALm 

OICAP -> OICAPm 

C160CAR <-> C160CARm 

CAR<->CARm 

DMMCOAm -> LMMCOAm 5.1 .99.1 

THR -> NH3 + K20 + OBUT 42 . 1.16 

THR + NAD -> CQ2 + NADH + AMA 1.T.1.1Q3 
THR + NAD + COA -> NADH + ACCOA + GLY 

AASA + NAD -> NADH + AADP 1.2.1.31 

FKYN + H20 -> FOR + KYN i5X9 

CMUSA -> C02 + AM6SA 4.1 . 1.45 

AM6SA + NAD -> AMU CO + NADH 1.2.1.32 

AMUCO + NADPH -> KADP + NADP + NH4 1.5.1- 
CYSS + AKG c-> GLU + SPYR 

URO + H20-> 4I5P 4.2.1.49 
4I5P + H20->F1GLU 
GLU <-> GLUm + Hm 
. ORN + Hm -> ORNm 
ORN + Hm + CfTRm <-> CitR + ORNm 
GLU + ATP + NADPH -> NADP + ADP + PI + GLUGSAL 
GLYAm + ATPm -> ADPm + 2PGm 



AMBSA->PIC 

SPYR + H20 -> H2S03 + PYR 
P5C <•> GLUGSAL 

fatty acid syntHssis 

MALCOA + ACP <-> MALACP + COA 2.3.1.39 
ACCOA + ACP <~> ACACP + COA 

ACACP + 4 MALACP + 8 NADPH -> 8 NADP + C100ACP + 4 
C02 + 4ACP 

ACACP + 5 MALACP + 10 NADPH -> 10 NADP + C12QACP + 5 
C02 + 5ACP 

ACACP + 6 MALACP + 12 NADPH -> 12 NADP + C140ACP + 6 
C02 + 6ACP 

ACACP + 6 MALACP + 11 NADPH -> 11 NADP + C 141 ACP + 6 
C02 + 6 ACP 

ACACP + 7 MALACP + 14 NADPH -> 14 NADP + C160ACP + 7 
C02 + 7ACP 

ACACP + 7 MALACP + 13 NADPH -> 13 NADP + C161ACP + 7 
C02 +7 ACP 
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fatty acid degradation 



Phospholipid metabolism 



Muscle contraction 



94 

ACACP + 8 MALACP + 16 NADPH -> 16 NADP + C180ACP + 8 
C02 + 8ACP 

ACACP + 8 MALACP + 15 NADPH -> 15 NADP + C181ACP + 8 
C02 + 8ACP 

ACACP + 8 MALACP + 14 NADPH -> 14 NADP + C182ACP + 8 
C02 + 8ACP 

C160COA + CAR •> C1 60CAR + COA 
C160CARm + COAm -> C160COAm + CARm 

GUP + 0.017 C100ACP + 0.062 C120ACP + 0.1 C140ACP + 
0.27 C160ACP + 0.169 C161ACP + 0.055 C160ACP + 0.235 
C181 ACP + 0.093 C1 B2ACP -> AGL3P + ACP 
TAGLYm + 3 H20m -> GLm + 3 C160m 

SAM + PMME -> SAH + PDME 
PDME + SAM -> PC + SAH 
PE + SER<->PS + ETHM 

MYOACT + ATP -> MYOATP + ACTIN 

MYOATP + ACTIN -> MYOADPAC 

MYOADPAC -> ADP + PI + MYOACT + CONTRACT 



WO 03/082214 



PCT/US03/09751 



95 

Table 2 

// Homo Sapiens Core Metabolic Network // 
i 

// Glycolysis // 

-1 GLC -1 ATP +1 G6P +1 ADP 0 HK1 
-1 G6P -1 H20 +1 GLC +1 PI 0 G6PC 
-1 G6P +1 F6P 0 GPIR 
-1 F6P -1 ATP +1 FDP +1 ADP 0 PFKL 
-1 FDP -1 H20 +1 F6P +1 PI 0 FBP1 
-1 FDP +1 T3P2 +1 T3P1 0 ALDOAR 
-1 T3P2 +1 T3P1 0 TPI1R 

-1 T3P1 -1 PI -1 NAD +1 NADH +1 13PDG 0 GAPDR 

-1 13PDG -1 ADP +1 3PG +1 ATP 0 PGK1R 

-1 13PDG +1 23PDG 0 PGAM1 

-1 23PDG -1 H20 +1 3PG +1 PI 0 PGAM2 

-1 3PG +1 2PG 0 PGAM3R 

-1 2PG +1 PEP +1 H20 0 ENOIR 

-1 PEP -1 ADP +1 PYR +1 ATP 0 PKLR 

-1 PYRm -1 COAm -1 NADm +1 NADHm +1 C02m +1 ACCOAm 0 PDHA1 
-1 NAD -1 LAC +1 PYR +1 NADH 0 LDHAR 
-1 G1P +1 G6P 0 PGM1R 

// TCA // 

-1 ACCOAm -1 OAm -1 H20m +1 COAm +1 CITm 0 CS 
-1 CIT +1 ICIT 0 ACOIR 
-1 CITm +1 ICITm 0 AC02R 

-1 ICIT -1 NADP +1 NADPH +1 C02 +1 AKG 0 IDH1 

-1 ICITm -1 NADPm +1 NADPHm +1 C02m +1 AKGm 0 IDH2 

-1 ICITm -1 NADm +1 C02m +1 NADHm +1 AKGm 0 IDH3A 

-1 AKGm -1 NADm -1 COAm +1 C02m +1 NADHm +1 SUCCOAm 0 OGDH 

-1 GTPm -1 SUCCm -1 COAm +1 GDPm +l'PIm +1 SUCCOAm 0 SUCLG1R 

-1 ATPm -1 SUCCm -1 COAm +1 ADPm +1 Pirn +1 SUCCOAm 0 SUCLA2R 

-1 FUMm -1 H20m +1 MALm 0 FHR 

-1 MAL -1 NAD +1 NADH +1 OA 0 MDH1R 

-1 MALm -1 NADm +1 NADHm +1 OAm . 0 MDH2R 

-1 PYRm -1 ATPm -1 C02m +1 ADPm +1 OAm +1 Pirn 0 PC 

-1 OA -1 GTP +1 PEP +1 GDP +1 C02 0 PCK1 

-1 OAm -1 GTPm +1 PEPm +1 GDPm +1 C02m 0 PCK2 

-1 ATP -1 CIT -1 COA -1 H20 +1 ADP +1 PI +1 ACCOA +1 OA 0 

ACLY 
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// PPP // 

-1 G6P -1 NADP +1 D6PGL +1 NADPH 
-1 D6PGL -1 H20 +1 D6PGC 0 PGLS 
-1 D6PGC -1 NADP +1 NADPH +1 C02 +1 RL5P 
-1 RL5P +1 X5P 0 RPER 
-1 R5P -1 X5P +1 T3P1 +1 S7P 
-1 X5P -1 E4P +1 F6P +1 T3P1 
-1 T3P1 -1 S7P +1 E4P +1 F6P 
-1 RL5P +1 R5P 0 RPIAR 



0 G6PDR 



0 TKT1R 
0 TKT2R 
0 TALD01R 



0 PGD 



// Glycogen // 

-1 G1P -1 UTP +1 UDPG +1 PPI 0 OGP1 
-1 UDPG +1 UDP +1 GLYCOGEN 0 GYS1 
-1 GLYCOGEN -1 PI +1 G1P 0 GBE1 



// ETS // 

-1 MALm -1 NADPm +1 C02m +1 NADPHm +1 PYRm 0 ME 3 

-1 MALm -1 NADm +1 C02m +1 NADHm +1 PYRm 0 ME2 

-1 MAL -1 NADP +1 C02 +1 NADPH +1 PYR 0 ME1 

-1 NADHm -1 Qm -4 Km +1 QH2m +1 NADm +4 H 0 MTNDl 

-1 SUCCm -1 FADm +1 FUMm +1 FADH2m 0 SDHC1R 

-1 FADH2m -1 Qm +1 FADm +1 QH2m 0 SDHC2R 

-1 02m -4 FEROm -4 Hm +4 FERIm +2 H20m +4 H 0 DQCRFS1 

-1 QH2m -2 FERIm -4 Hm +1 Qm +2 FEROm +4 H 0 COX5BL4 

-1 ADPm -1 PIm -3 H +1 ATPm +3 Hm +1 H20m 0 MTAT 

-1 ADP -1 ATPm -1 PI -1 H +1 Hm +1 ADPm +1 ATP +1 PIm 

-1 GDP -1 GTPm -1 PI -1 H +1 Hm +1 GDPm +1 GTP +1 PIm 

-1 PPI +2 PI 0 PP 



0 AT PMC 
0 GTPMC 



-1 ACCOA -1 ATP -1 C02 +1 MALCOA +1 ADP +1 PI 0 ACACAR 
-1 GDP -1 ATP +1 GTP +1 ADP 0 GOT3R 



// Transporters // 

-1 CIT -1 MALm +1 CITm +1 MAL 0 CITMCR 
-1 PYR -1 H +1 PYRm +1 Hm 0 PYRMCR 

// Glycerol Phosphate Shuttle // 

-1 GL3Pm -1 FADm +1 T3P2m +1 FADH2m 0 GPD2 

-1 T3P2 -1 NADH +1 GL3P +1 NAD 0 GPDl 

-1 GL3P +1 GL3Pm 0 GL3PMCR 

-1 T3P2 +1 T3P2m 0 T3P2MCR 

// Malate/Aspartate Shuttle // 

-1 OAm -1 GLUm +1 ASPm +1 AKGm 0 G0T1R 

-1 ASP -1 AKG +1 OA +1 GLU 0 GOT2R 

-1 AKG -1 MALm +1 AKGm +1 MAL 0 MALMCR 

-1 ASPm -1 GLU -1 H +1 Hm +1 GLUm +1 ASP 0 ASPMC 
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// Exchange Fluxes // 

+1 GLC 0 GLCexR 

+1 PYR 0 PYRexR 

+1 C02 0 C02exR 

+1 02 0 02exR 

+1 PI 0 PlexR 

+1 H20 0 H20exR 

+1 LAC 0 LACexR 

+1. C02m 0 C02min 
-1 C02m 0 C02mout 
+1 02m 0 02min 
-1 02m 0 02mout 
+1 H20m 0 H20min 
-1 H20m 0 H20mout 
+1 Pirn 0 PImin 
-1 PIm 0 PImout 



// Output // 

-1 ATP +1 ADP +1 PI 0 Output 
0.0 end 
end E 0 
max 

1 Output 
0 end 

0 GLCexR 1 
-1000- PYRexR 0 
-1000 LACexR 0 

0 end 0 
rev. rxn 33 
nonrev. xxn 31 
total rxn 64 
matrix columns 97 
unique enzymes 52 
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Table 3 

Abbrev. Reaction 
Glycolysis 
HK1 

GSPC, G5PT 
GPI 
PFKL 

FBP1.FBP 
ALDOA 
TP11 

GAPD, GAPDH 
PGK1, PGKA 
PGAM1, PGAMA 



EN01, PPH. EN01L1 
PKLR, PK1 

PDHA1,PHE1A,PDHA 

LDHA. LDH1 

PGM1 

TCA 

CS 

AC01.IREB1.IRP1 

AC02 

IDH1 

IDH2 

IDH3A 

OGDH 

SUCLG1 , SUCLA1 

SUCLA2 

FH 

MDH1 
MDH2 
PC, PCB 

ACLY, ATPCL, CLATP 

PCK1 

PPP 

G6PD. G6PD1 

PGLS, 6PGL 

PGD 

RPE 

TKT 

TALD01 
UGP1 

ACACA, ACAC, ACC 

ETS 

ME3 

MTND1 

SDHC 



UQCRFS1, RIS1 
COX5BL4 
MTATP6 
PP, SID6-8061 
Maiate Aspartate shunttle 
GOT1 
GOT2 



GLC + ATP -> G6P + ADP 
G6P + H20->GLC + PI 
G6P <~> F6P 
F6P + ATP -> FDP + ADP 
FDP + H20->F6P+Pl 
FDP<->T3P2 + T3P1 
T3P2<->T3P1 

T3P1 + PI + NAD <-> NADH + 13PDG 
13PDG + ADP <-> 3PG + ATP 
13PDG<->23PDG 
23PDG + H20->3PG + PI 
3PG <-> 2PG 
2PG<->PEP + H20 
PEP + ADP -> PYR + ATP . 

PYRm + COAm + NADm -> + NADHm + C02m + ACCOAm 
NAD + LAC «-> PYR + NADH 
G1P <-> G6P 

ACCOAm + OAm + H20m -> COAm + CITm 

CIT <-> IC!T 

CITm <-> ICITm . 

ICIT + NADP -> NADPH + C02 + AKG 

ICITm + NADPm -> NADPHm + C02m + AKGm 

ICITm + NADm -> C02m + NADHm + AKGm 

AKGm + NADm + COAm -> C02m + NADHm + SUCCOAm 

GTPm + SUCCm + COAm <-> GDPm + Plm + SUCCOAm 

ATPm + SUCCm + COAm <-> ADPm + Plm + SUCCOAm 

FUMm + H20m <-> MALm 

MAL + NAD <-> NADH + OA 

MALm + NADm <-> NADHm + OAm 

PYRm + ATPm + C02m -> ADPm + OAm + Plm 

ATP + CIT + COA + H20 -> ADP + PI + ACCOA + OA 

OA + GTP -> PEP + GDP + C02 

G6P + NADP <-> D6PGL + NADPH 

D6PGL + H20 -> D6PGC 

DSPGC + NADP r> NADPH + C02 + RL5P 

RL5P <-> X5P 

R5P + X5P<^>T3P1 +S7P 

X5P + E4P <-> F6P + T3P1 

T3P1 + S7P <-> E4P + F6P 

G1P + UTP->UDPG + PPI 

ACCOA + ATP + C02 <-> MALCOA + ADP + PI + H 

MALm + NADPm -> C02m + NADPHm + PYRm 
NADHm + Qm + 4 Hm •> QH2m + NADm + 4 H 
SUCCm + FADm <-> FUMm + FADH2m 
FADH2m + Qm <-> FADm + QH2m 
02m + 4 FEROm + 4 Hm -> 4 FERlm + 2 H20m + 4 H 
QH2m + 2 FERlm + 4 Hm -> Qm + 2 FEROm + 4 H 
ADPm + P»m + 3H->ATPm + 3Hm + H20m 

PPl->2PI 



OAm + GLUm <-> ASPm + AKGm 
OA + GLU <-> ASP + AKG 
GDP + ATP <-> GTP + ADP 



RxnName 

HK1 

G6PC 

GPI 

PFKL 

FBP1 

ALDOA 

TPI1 

GAPD 

PGK1 

PGAM1 

PGAM2 

PGAM3 

EN01 

PKLR 

PDHA1 

LDHA 

PGM1 

CS 

AC01 

AC02 

IDH1 

IDH2 

IDH3A 

OGDH 

SUCLG1 

SUCLA2 

FH 

MDH1 

MDH2 

PC 

ACLY 

PCK1 

G5PD 

PGLS 

PGD 

RPE 

TKT1 

TKT2 

TALD01 

UGP1 

ACACA 

ME3 

MTND1 

SDHC1 

SDHC2 

UQCRFS1 

COX5BL4 

MTAT 

PP 

GOT1 
GOT2 
GOT3 
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Glycogen 
GBE1 

GYS1.GYS 



GLYCOGEN + PI -> G1P 
UDPG -> UDP + GLYCOGEN 



Glycerol Phosphate Shunttie 



GPD2 
GPD1 
RPIA, RPI 

Mitochondria Transport 



GL3Pm + FADm -> T3P2m + FADH2m 
T3P2 + NADH -> GL3P + NAD 
RL5P <-> R5P 

CTT + MALm c-> CfTrn + MAL 
GL3P <-> GL3Pm 
T3P2 <-> T3P2m 
PYR <-> PYRm + Hm 

ADP + ATPm + PI + H -> Hm + ADPm + ATP + Ptm 

AKG + MALm <-> AKGm + MAL 

ASPm + GLU + H -> Hm + GLUm + ASP 

GDP + GTPm + PI + H •> Hm + GDPm + GTP + Pirn 



GBE1 
GYS1 

GPD2 
GPD1 
RPIA 

C1TMC 

GL3PMC 

T3P2MC 

PYRMC 

ATPMC 

MALMC 

ASPMC 

GTPMC 
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TABLE 4 

Metabolic Reaction for Muscle Cells 
Reaction 

GLC + ATP ■> G6P + ADP 

G5P<->F6P 

F6P + ATP -> FDP + ADP 

FDP + H20->F6P + PI 

FOP<->T3P2 + T3P1 

T3P20T3P1 

T3P1 + p| + NAD <-> NADH + 13PDG 
13PDG + ADP <•> 3PG + ATP 
3PG <->2PG 
2PG c-> PEP + H20 

PEP + ADP -> PYR + ATP ^ a 

PYRm + COAm + NADm -> + NADHm + C02m + ACCOAm 

NAD + LAC <-> PYR + NADH 

G1P<->G6P 

ACCOAm + OAm + H20m -> COAm + CTTm 

crr<->icrr 

CTTm ICITm 

yCXT + NADP -> NADPH + C02 + AKG 

ICITm + NADPm ■> NADPHm + C02m + AKGm 

ICITm + NADm -> C02m + NADHm + AKGm 

AKGm ♦ NADm + COAm ->C02m + NADHm + SUCO)Am 

GTPm + SUCCm + COAm <-> GDPm + Plm + SUCCOAm 

ATPm + SUCCm * COAm <■> ADPm + Plm + SUCCOAm 

FUMm + H20m <-> MALm 

MAL + NAD o NADH + OA 

MALm + NADm <-> NADHm + OAm 

PYRm + ATPm + C02m -> ADPm + OAm + Plm 

ATP + CTT + COA + H20 -> ADP + PI + ACCOA + OA 

OA + GTP-> PEP + GDP + C02 

OAm + GTPm -> PEPm + GDPm + C02m 

G5P + NADP <-> D6PGL + NADPH 

DBPGL + H20 -> D6PGC 

D5PGC + NADP -> NADPH + C02 + RLSP 

RL5P<u>X5P 

R5P + X5P <^> T3P1 + S7P 

X5P + E4P <-> F6P + T3P1 

T3P1 + S7P <-> E4P + F6P 

RL5P<->R5P 

G1P + UTP->UDPG + PPI 

GLYCOGEN + PI -> G1P 

UDPG -> UDP + GLYCOGEN 

MALm + NADm -> C02m + NADHm + PYRm 

MALm + NADPm -> C02m + NADPHm + PYRm 

MAL + NADP ->C02 + NADPH + PYR 

NADHm + Qm + 4 Hm -> QH2m + NADm + 4 H 

SUCCm + FADm o FUMm + FADH2m 

FADH2m + Cto<^FADm + QH2m 

02m + 4FEROm + 4Hm->4FERlm + 2H20m + 4H 

QH2m + 2 FERIm + 4 Hm -> Qm + 2 FEROm + 4 H 

ADPm + Plm + 3 H -> ATPm + 3 Hm + H20m 

ADP + ATPm + PI + H -> Hm + ADPm + ATP + Plm 

GDP + GTPm + PI + H *> Hm + GDPm + GTP + Plm 

PPI->2Pl 

GDP + ATP<->GTP + ADP 

ACCOA + ATP + C02 <w> MALCO A + ADP + PI + H 

MALCOA + ACP MALACP + COA 

ACCOA + ACP <^> ACACP + COA 



RxtName 
0HK1 
0GP1 
0PFKL1 
0FBP1 
0ALDOA 
0TPI1 
OGAPD 
0PGK1 
0PGAM3 
0ENO1 
0PK1 
0PDHA1 
OLDHA 
0PGM1 
OCS 
0ACO1 
0ACO2 
0IDH1 
01DH2 
0IDH3A 
OOGDH 
0SUCLG1 
0SUCLA2 
OFH 
0MDH1 
0MDH2 
OPC 
OACLY 
0PCK1 
0PCK2 
0G6PD 
0H6PD 
OPGD 
ORPE 
0TKT1 
0TKT2 
0TALDO1 
ORPIA 
0UGP1 
0GBE1 
0GYS1 
0ME2 
0ME3 

0 HUMNDME 

0MTND1 

0 SDHC1 

0SDHC2 

0UQCRFS1 

0 COX5BL4 

0MTAT1 

OATPMC 

OGTPMC 

OPP 

0NME1 

OACACA 

0FAS1 1 

0 FAS1_2 



ACACP + 4 MALACP + 8 NADPH -> 8 NADP + C100ACF • + 4 ^ + 4 ACP 
ACACP + 5 MALACP + 10 NADPH •> 10 NADP * C12QACP + 5 C02 + 5 

ACACP + 6I^CP + 12NADPH^12NADP + C140ACP + 6CO2 + 6 
ACACP + 6 MALACP + 11 NADPH •> 11 NADP + C141ACP + 6 C02 + 6 
ACACP + 7 MALACP + 14 NADPH -> 14 NADP + C160ACP + 7 C02 + 7 
ACACP + 7 MALACP + 13 NADPH ■> 13 NADP + C161ACP + 7 C02 + 7 
ACACP + 8^C^ + 16NADPH^16^DP + C180ACP + 8CO2 + 8 

ACACP + 8 MALACP + 15 NADPH -> 15 NADP + C181ACP + 8 C02 + 8 
ACACP + 8 MALACP + 14 NADPH ^ 14 NADP + C1B2ACP + 8 C02 + 8 
ACP 

C160ACP + H20 -> C160 + ACP 

C160 + COA + ATP -> AMP + PPI + C1BOCOA 



0C100SY 

0C120SY 

0 C140SY 

0C141SY 

0C160SY 

0 C161SY 

0C180SY 

0 C181SY 

0C182SY 
0 PPT! 
0KIAA 
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C160COA + CAR C160CAR + CQA 

C160CARm + COAm -> C160COAm + CARm 

C160CARm + COAm + FADm + NADm -> FADH2m + NADHm + 

C140COAm + ACCOAm 

C140CQAm + 7 COAm + 7 FADm + 7 NADm -> 7 FADH2m + 7 NADHm + 7 
ACCOAm 

TAGLYm + 3 H20m -> GLm + 3 C160m 

GL3P + 0.017 C10QACP + 0.062 C120ACP 4- 0.1 C14QACP + 0.27 
C15QACP + 0.169 C161ACP + 0.055 C18QACP + 0.235 C181ACP + 0.093 
C182ACP •> AGL3P + ACP 

AGL3P + 0.017 C100ACP + 0J062 C12QACP + 0.100 C140ACP + 0.270 

C160ACP + 0.169 C161ACP + 0.055 C180ACP + 0.235 C181ACP + 0.093 

C182ACP -> PA + ACP 

ATP + CHO->ADP + PCHO 

PCHO + CTP -> CDPCHO + PPI 

COPCHO + DAGLY -> PC + CMP 

SAM + PE -> SAH + PMME 

SAM + PMME ■> SAH + PDME 

PDME+ SAM •> PC + SAH 

GSP->MI1P 

MI1P->MYOI + PI 

PA + CTP <-> CDPD G + PPI 

COPDG + MYOl -> CMP + PINS 

ATP + PINS -> ADP + PINSP 

ATP + PINS -> ADP *• PINS4P 

PINS4P + ATP -> D45PI + ADP 

D45PI->TPI + DAGLY 

PA+H20->DAGLY+Pl 

DAGLY + 0.017 C10DACP + 0.062 C12QACP + 0.100 C140ACP + 0.270 

C160ACP + 0.169 C161ACP + 0.055 C18QACP + 0.235 C181ACP + 0.093 

C182ACP -> TAGLY + ACP • 

COPDG + SER <■> CMP + PS 

CDPETN + DAGLY <-> CMP + PE 

PE + SER o PS + ETHM 

ATP + ETHM -> ADP + PETHM 

PETflM + CTP -> CDPETN + PPI 

PS->PE + C02 

3HBm + NADm -> NADHm + Hm + ACTACm 

ACTACm ♦ SUCCOAm ■> SUCCm + AACOAm 

THF + SER <-> GLY ♦ METTHF 

THFm * SERm <■> QJYm ♦ METTHFm 

SS&n + PYRm <-> ALAm + 3HPm 

3PG + NAD <-> NADH + PHP 

PHP + GLU<->AKG + 3PSER 

3PSER + H20 PI + SER 

3HPm + NADHm •> NADm ♦ GLYAm 

SER->PYR + NH3 + H20 

GLYAm + ATPm •> ADPm + 2PGm 

PYR + QJU <■> AKG + ALA 

GLUm + C02m + 2 ATPm -> 2 ADPm + 2 Plm + CAPm 
AKGm + NADHm + NH3m <-> NADm + H20m + GUJm 
AKGm ♦ NADPHm + NK3m <-> NADPm + H20m + GLUm 
GLUm + NH3m + ATPm ■> GLNm ♦ ADPm + Plm 
ASPm + ATPm + GLNm -> GLUm + ASNm + AMPm + PPim 
ORN + AKG <-> GLUGSAL + GLU 
GLU <-> GLUm + Hm 

GLU + ATP + NADPH -> NADP + ADP + PI + GLUGSAL 

GLUP + NADH -> NAD + PI + GLUGSAL 

P5C <^> GLUGSAL 

HJS->NH3 + URO 

URO + H20 -> 415P 

415P + H20->RGU1 

BGAJJ + THF -> NFTHF ♦ GLU 

MET + ATP + H2D -> PPI + PI + SAM 

SAM ♦ DNA •> SAH ♦ DNA5MC 

SAH + H20-> HCYS + ADN 

HCYS ♦ MTHF -> THF «■ MET 

SER + HCYS -> LLCT ♦ H2D 

LLCT + H20 -> CYS + HSER 

OBUT + NH3<->HSER 

CYS + 02 <-> CYSS 

CYSS + AKG <-> GLU + SPYR 

SPYR ♦ H20 ^ H2SC3 + PYR 

LYS + NADPH* AKG -> NADP + H20 + SAC 

SAC + H20 ♦ NAD -> GLU + NADH + AASA 

AASA + NAD -> NADH ♦ AADP 

AADP + AKG -> GLU + KADP 

TF8 3 ♦ 02 -> FKYN 

FKYN 4- H20-> FOR + KYN 

KYN + NADPH + 02 -> HKYN + NADP + H20 

KKYN ♦ H2D-> HAN + ALA 



OC160CA 
0C160CB 

0HADHA 

0HADH2 
0TAGRXN 



0GAT1 



0 AGPAT1 

0CHKL1 

0PCYT1A 

0LDC 

OPBOIT 

0MFPS 

0PNMNM 

0ISYNA1 

0IMPA1 

0CDS1 

OPtS 

0PIK3CA 

0 P0C4CA 

0PIP5K1 

0 PLCB2 

0PPAP2A 



0DGAT 

0 PTDS 

0CEPT1 

0PESER 

0EKI1 

0 PCYT2 

0PISD 

0BDH 

03OCT 

0SHMT1 

0SHMT2 

0AGXT 

0PHGDH 

0PSA 

0PSPH 

OGLYD 

OSDS 

OGLTK 

OGPT 

0CPS1 

0GLUD1 

0GLUD2 

0 GLUL 

0 ASKS 

OOAT 

0 GLUMT 

0P5CS 

OPYCS 

OSPTC 

0 HAL 

OUROH 

OIMPR 

0 FTCD 

0MAT1A 

0DNMT1 

0AHCYL1 

0 MTR 

OCBS 

0 CTH1 

0CTH2 

0COO1 

OCYSAT 

0 SPTB 

0LKR1 

0LKR2 

02ASD 

0LOC5 

0TDO2 

OKYNF 

OKMO 

OKYNU2 
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HAN + 02 -> CMUSA 
CMUSA •> C02 + AM6SA 
AM6SA->PIC 

AM6SA + NAD -> AMUCO + NADH 

AMUCO + NADPH -> KADP + NADP + NH4 

ARG -> ORN + UREA 

ORN + Hm •> ORNm 

ORN + Hm + CTfRm <-> CfTR + ORNm 

ORNm + CAPm •> CITRm + Pim + Hm 

CTTR * ASP + ATP <-> AMP + PP1 + ARGSUCC 

ARGSUCC->FUM + ARG 

PRO + FAD -> P5C + FADH2 

P6C + NADPH -> PRO + NADP 

THR->NH3+H20 + OBUT 

THR + NAD -> C02 + NADH + AMA 

AMA + H20 + FAD -> NH3 + FADH2 + MTHGXL 

GLYm + THFm + NADm o METTHFm + NADHm + C02m + NH3m 

PHE + THBP + 02->TYR+ DHBP + H20 

NADPH + DHBP -> NADP + THBP 

AKG + TYR -> HPHPYR+ GLU 

HPHPYR + 02 -> HGTS + C02" 

HGTS + 02 -> MACA 

MACA •> FACA 

FACA + H20 -> FUM + ACA 

AKG + HE •> OMVAL + GLU 

OMVALm + COAm + NADm «■> MBCOAm + NADHm + C02m 

MBCOAm + FADm -> MCCOAm + FADH2m 

MCCOAm + H20m -> MHVCOAm 

MHVCOAm + NADm -> MAACOAm + NADHm 

MAACOAm -> ACCOAm + PROPCOAm 

2 ACCQAm <-> COAm + AACCOAm 

AKG + VAL •> OIVAL + GLU 

OIVALm + COAm + NADm -> IBCOAm + NADHm + C02m 
©COAm + FADm -> MACOAm + FADHZm 
MACOAm + HZOm ■> HIBCQAm 
HIBCOAm + H20m -> HIBm + COAm 
HIBm + NADm -> MMAm + NADHm 

MMAm + COAm + NADm -> NADHm + C02m + PROPCOAm 
PROPCOAm + C02m + ATPm -> ADPm + Plm + DMMCOAm 
DMMCOAm •> LMMCQAm 
LMMCQAm -> SUCCOAm 
AKG + LEU -> OICAP + GLU 

OlCAPm + COAm + NADm •> IVCOAm + NADHm + C02m 
OICAPm + COAm + NADH -> IVCOAm ♦ NADHm + C02m 
OlCAPm + COAm + NADHm •> IVCOAm + NADHm + C02m 
IVCOAm + FADm ■> MCRCOAm + FADH2m 
MCRCOAm + ATPm + C02m + H20m -> MGCOAm + ADPm + Pirn 
MGCOAm + H20m -> H3MCOAm 
H3MCOAm -> ACCQAm + ACTACm 
MYOACT + ATP -> MYOATP + ACTIN 

MYOATP + ACTIN -> MYOADP AC 

MYOADPAC -> ADP + P1 + MYOACT + CONTRACT 

PCRE+ ADP -> CRE + ATP 

AMP + H20->Pi + ADN 

ATP*AMP<->2ADP 

Q2<-> 02m 

3HB -> 3HBm 

CTT + MALrn <-> CTTm + MAL 

PYR <-> PYRm + Hm 

C160CAR + COAm -> C160COAm + CAR 

OMVAL ■> OMVALm 

OIVAL -> OIVALm 

OICAP -> OlCAPm 

GL<-> GLm 

GL3Pm + FADm •> T3P2m + FADH2m 

T3P2 ♦ NADH <-> GL3P + NAD 

GL3P<->GL3Pm 

T3P2 «=-> T3P2m 

OAm ■•"GLJUm «■> ASPm + AKGm 

OA + GLU <-> ASP + AKG 

AKG + MALm <^> AKGm + MAL 

ASPm + GLU + H -> Hm + GLUm + ASP 

GLCxt->GLC 

02x1 ->02 

C150Axt + FABP ■> C160FP + ALBxt 

C160FP -> C160 + FABP 

CIBOAxt + FABP -> C180FP + ALBxt 

C180FP->C180 + FABP 

C161Axt + FABP -> C161FP + ALB* 

C161FP->C161 + FABP 

C1B1AX1 + FABP •> C181FP + ALBxt 



OHAAO 

OACSD 

OSPTA 

OAMSD 

0 2AMR 

0ARG2 • 

OORNMT 

ooRNcrrr 
o arc 

OASS 

OASL 

OPRODH 

0PYCR1 

OWTDH 

OTDH 

OMAOA 

0 AMT 

OPAH 

OQDPR 

OTAT 

OHPD 

OHGD 

0 GSTZ1 

OFAH 

0BCAT1A 

OBCKDHAA 

OACADMA 

0ECHS1B 

0 EHHADHA 

0 ACAA2 

OACATml 

0BCAT1B 

OBCKDHAB 

OACADSB 

O&IHADHC 

OHIBCHA 

0 EHHADHB 

OMMSDH 

OPCCA 

OHIBCHF 

OMUT 

0BCAT1C 

OBCKDHAC 

0 BCKDHBC 

ODBTC 

OIVD 

0MCCC1 

0H1BCHB 

OHMGCL 

OMYOSA 

OMYOSB 

OMYOSC 

OCREATA 

OCREATB 

OCREATC 

002MT 

OHBMT 

OCTTMC 

OPYRMC 

0C160CM 

OHIBCHC 

0 HIBCHD 

0 HIBCHE 

OGLMT 

0GPD2 

0GPD1 

0GL3PMC 

0T3P2MC 

0GOT1 

0GOT2 

OMALMC 

OASPMC 

0GLUT4 

0O2UP 

0FAT1 

0FAT2 

0FAT3 

0FAT4 

0FAT5 

0FAT6 

0FAT7 
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C1B1FP -> C181 + FABP 

C1B2Axt + FABP -> C1B2FP + ALBxt 

C182FP C1B2 + FABP 

C204Axt + FABP -> C2D4FP ♦ ALBxt 

C204FP->C204 + FABP 

PYRxt + HEXT <-> PYR ♦ H 

LACM + HEXT <-> LAC + HEXT 

H «-> HEXT 

C02<->C02m 

H20coH20m 

ATP + AG + CQA -> AMP + PPl + ACCQA 

C160CAR <-> C160CARm 

CARm<^>CAR 

C02xt<->C02 

H2Oxt<->H20 

Pbct + HEXT <-> HEXT + PI 

<-> GLCxt 

<-> PYRxt 

<-> C02act 

<-> 02rt 

<-> PIxt 

<-> H20xt 

<-> LACact 

<-> C16QAact 

<-> cieiAxt 

<-> ClBQAact 
<-> C181Axt 
<-> C182Axt 
<-> C204Axt 
<-> ALBxt 
<-> 3HB 
« GLYCOGEN 
<->PCRE 
<~> TAGLYm 
<->lLE 
<->VAL 
<^>CRE 
<-> ADN 
<^>PI 
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OFATB 
0FAT9 
0 FAT10 
0 FAT11 
0FAT12 
OPYRUP 
OLACUP 
OHextUP 
0CO2MT 
0 H2QMT 
0FU2 
. OC160MT ' 
OCARMT 
0CO2UP 
0H2OUP 
OPIUP 
OGLCexR 
0 PYRexR 
0CO2exR 
0O2exR 
OPtexR 
0 H20exR 
OLACexR 
0C16QAexR 
0Cl61AexR 
OC180AexR 
0 C181AexR 
0C182AaxR 
0C204AexR 
0 ALBexR 
OHBexR 
OGLYex 
OPCREex 
OTAGmex 
OlLEex 
OVALex 
OCREex 
DAONex 
OPtex 
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What is claimed is: 

1. A computer readable medium or media, 
comprising: 

(a) a data structure relating a plurality of 
5 Homo sapiens reactants to a plurality of Homo sapiens 
reactions, 

wherein each of said Homo sapiens reactions 
comprises a reactant identified as a substrate of the 
reaction, a reactant identified as a product of the 
10 reaction and a stoichiometric coefficient relating said 
substrate and said product, 

wherein at least one of said Homo sapiens 
reactions is annotated to indicate an associated gene; 

(b) a gene database comprising information 
15 characterizing said associated gene; 

(c) a constraint set for said plurality of 

Homo sapiens reactions, and 

(d) commands for determining at least one 
flux distribution that minimizes or maximizes an 

.20 objective function when said constraint set is applied 
to said data representation, wherein said at least one 
flux distribution is predictive of a Homo sapiens 
physiological function. 

2. The computer readable medium or media of 
25 claim 1, wherein said plurality of Homo sapiens 
reactions comprises at least one reaction from a 
peripheral metabolic pathway. 



3. The computer readable medium or media of 
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claim 2, wherein said peripheral metabolic pathway is 
selected from the group consisting of amino acid 
biosynthesis, amino acid degradation, purine 
biosynthesis, pyrimidine biosynthesis, lipid 
5 biosynthesis, fatty acid metabolism, cof actor 
biosynthesis and transport processes. 

4 . The computer readable medium or media of 
claim 1, wherein said Homo sapiens physiological 
function is selected from the group consisting of 

10 growth, energy production, redox equivalent production, 
biomass production r production of biomass precursors, 
production of a protein, production of an amino acid, 
production of a purine, production of a pyrimidine, 
production of a lipid, production, of a fatty acid, 

15 production of a cofactor, transport of a metabolite, 

and consumption of carbon, nitrogen, sulfur, phosphate, 
hydrogen or oxygen. 

5. The computer readable medium or media of 
claim 1, wherein said Homo sapiens physiological 

20 function is selected from the group consisting of 

degradation of a protein, degradation of an amino acid, 
degradation of a purine, degradation of a pyrimidine, 
degradation of a lipid, degradation of a fatty acid and 
degradation of a cofactor. 

25 6. The computer readable medium or media of 

claim 1, wherein said data structure comprises a set of 
linear algebraic equations. 

7. The computer readable medium or media of 
claim 1, wherein said data structure comprises a 
30 matrix. 
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The computer readable medium or media of 



claim 1, wherein said commands comprise an optimization 
problem. 

9. The computer readable medium or media of 
5 claim 1, wherein said commands comprise a linear 

program. 

10. The computer readable medium or media of 
claim 1, wherein at least one reactant in said 
plurality of Homo sapiens reactants or at least one 

10 reaction in said plurality of Homo sapiens reactions is 
annotated with an assignment to 'a subsystem or 
compartment . 

11. The computer readable medium or media of 
claim 10, wherein a first substrate or product in said 

15 plurality of Homo sapiens reactions is assigned to a 
first compartment and a second substrate or product in 
said plurality of Homo sapiens reactions is assigned to 
a second compartment. 



20 



12. The computer readable medium or media of 
claim 1, wherein a plurality of said Homo sapiens 
reactions is annotated. to indicate a plurality of 
associated genes and wherein said gene database 
comprises information characterizing said plurality of 
25 associated genes. 

13. A computer readable medium or media, 
comprising: 

(a) a data structure relating a plurality of 
Homo sapiens reactants to a plurality of Homo sapiens 
30 reactions, 
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wherein each of said Homo sapiens reactions 
comprises a reactant identified as a substrate of the 
reaction, a reactant identified as a product of the 
reaction and a stoichiometric coefficient relating said 
5 substrate and said product, 

wherein at least one of said Homo sapiens 
reactions is a regulated reaction; 

(b) a constraint set for said plurality of 
Homo sapiens reactions, wherein said constraint set 

10 includes a variable constraint for said regulated 
reaction, and 

(c) commands for determining at least one 
flux distribution that minimizes or maximizes an 
objective function when said constraint set is applied 

15 to said data representation, wherein said at least one 
flux distribution is predictive of a Homo sapiens 
physiological function. 

14. The computer readable medium or media of 
claim 13, wherein said variable constraint is dependent 
20 upon the outcome of at least one reaction in said data 
structure. 

•15. The computer readable medium or media of 
claim 13, wherein said variable constraint is dependent 
upon the outcome of a regulatory event. 

25 16. The computer readable medium or media of 

claim 13, wherein said variable constraint is dependent 
upon time. 



17. The computer readable medium or media of 
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claim 13, wherein said variable constraint is dependent 
upon the presence of a biochemical reaction network 
participant. 

18. The computer readable medium or media of 
claim 17, wherein said participant is selected from the 
group consisting of a substrate, product, reaction, 
protein, macromolecule, enzyme and gene. 

19. The computer readable medium or media of 
claim 13/ wherein a plurality of said reactions are 
regulated reactions and said constraints for said 
regulated reactions comprise variable constraints. 

20. A computer readable medium or media, 

15 comprising: 

(a) a data structure relating a plurality of 

Homo sapiens skeletal muscle cell reactants to a 

plurality of Homo sapiens skeletal muscle cell 

reactions, wherein each of said Homo sapiens 

20 reactions comprises a reactant identified as a 

substrate of the reaction, a reactant identified as a 

product of the reaction and a stoichiometric 

coefficient relating said substrate and said product; 

(b) a constraint set for said plurality of 

25 Homo sapiens reactions, and 

(c) commands for determining at least one 
flux distribution that minimizes or maximizes an 
objective function when said constraint set is applied 
to said data representation, wherein said at least one 

30 flux distribution is predictive of Homo sapiens 
skeletal muscle cell energy production. 



21. A method for predicting a Homo sapiens 
physiological' function, comprising: 
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(a) providing a data structure relating a 
plurality of Homo sapiens reactants to a plurality of 
Homo sapiens reactions, 

wherein each of said Homo sapiens reactions 
5 comprises a reactant identified as a substrate of the 
reaction, a reactant identified as a product of the 
reaction and a stoichiometric coefficient relating said 
substrate and said product, 

wherein at least one of said Homo sapiens 
10 reactions is annotated* to indicate an associated gene; 

(b) providing a constraint set for said 
plurality of Homo sapiens reactions; 

(c) providing an objective function, and 

(d) determining at least one flux 

15 distribution that minimizes or maximizes said objective 
function when said constraint set is applied to said 
data structure, thereby predicting a Homo sapiens . 
physiological function related to said gene. 

22. The method of -claim 21, wherein said 

20 plurality of Homo sapiens reactions comprises at least 
one reaction from a peripheral metabolic pathway. 

23. The method of claim 22, wherein said 
peripheral metabolic pathway is selected from the group 
consisting of amino acid biosynthesis, amino acid 

25 degradation, purine biosynthesis, pyrimidine 
biosynthesis, lipid biosynthesis, fatty acid 
metabolism, cofactor biosynthesis and transport 
processes - 



24. The method of claim 21, wherein said 
30 Homo sapiens physiological function is selected from 
the group consisting of growth, energy production, 
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redox equivalent production, biomass production, 
production of biomass precursors, production of a 
protein, production of an amino acid, production of a 
purine, production of a pyrimidine, production of a 
5 lipid, production of a fatty acid, production of a 

cofactor, transport of a metabolite, and consumption of 
carbon, nitrogen, sulfur, phosphate, hydrogen or 
oxygen . 

25. The method of claim 21, wherein said 
10 Homo sapiens physiological function is selected from 
the group consisting of glycolysis, the TCA cycle, 
pentose phosphate pathway, respiration, biosynthesis of 
an amino acid, degradation of an amino acid, 
biosynthesis of a purine, biosynthesis of a pyrimidine, 
15 biosynthesis of a lipid, metabolism of a fatty acid, 
biosynthesis of a cofactor, transport of a metabolite 
and metabolism of a carbon source, nitrogen source, 
oxygen source, phosphate source, hydrogen source or 
sulfur source. 



20 



25 



26. The method of claim 21, wherein said, 
data 

structure comprises a set of linear algebraic 
equations . 

27. The method of claim 21, wherein said 
data 

structure comprises a matrix. 

28. The method of claim 21, wherein said 
flux 

distribution is determined by linear programming. 



30 



WO 03/082214 PCT/US03/09751 

111 

29. The method of claim 21, further 
comprising: 

(e) providing a modified data structure , 
wherein said modified data structure comprises at least 

5 one added reaction, compared to the data structure of 
part (a) f and 

(f) determining at least one flux 
distribution 

that minimizes or maximizes said objective function 
10 when said constraint set is applied to said modified 
data structure, thereby predicting a Homo sapiens 
physiological function . 

30. The method of claim 29, further 
comprising 

15 identifying at least one participant in said at least 
one added reaction. 

31. The method of claim 30, wherein said 
identifying at least one participant comprises 
associating a Homo sapiens protein with said at least 

20 one reaction. 

32. The method of claim 31, further 
comprising 

identifying at least one gene that encodes said 
protein. 

25 33. The method of claim 30, further 

comprising 

identifying at least one compound that alters the 
activity or amount of said at least one participant, 
thereby identifying a candidate drug or agent that 
30 alters a Homo sapiens physiological function. 
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34. The method of claim 21, further 

comprising: 

•(e) providing a modified data structure, 
wherein said modified data structure lacks at least one 
5 reaction compared to the data structure of part (a) , 
and 

(f) determining at least one flux 
distribution 

that minimizes or maximizes said objective function 
10 when said constraint set is applied to said modified 
data structure, thereby predicting a Homo sapiens 
physiological function. 

35. The method of claim 34, further 
comprising 

15 identifying at least one participant in said at least 
one reaction. 

36. The method of claim 35, wherein said 
identifying at least one participant comprises 
associating a Homo sapiens protein with said at least 

20 one reaction. 

37. The method of claim 36, further 
comprising 

identifying at least one gene that encodes said protein 
that performs said at least one reaction. 



25 
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38. The method of claim 35, further 
comprising 

identifying at least one compound that alters the 
activity or amount of said at least one participant, 
thereby identifying a candidate drug or agent that 
alters a Homo sapiens physiological function. 
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39. The method of claim 21, further 
comprising: 

(e) providing a modified constraint set f 
wherein said modified constraint set comprises a 

5 changed constraint for at least one reaction compared 
to the constraint. for said at least one reaction in the 
data structure of part (a) , and 

(f) determining at least one flux 
distribution 

10 that minimizes or maximizes said objective function 
when said modified constraint set is applied to said 
data structure , thereby predicting a Homo sapiens 
physiological function. 

40. The method of claim 39, further 
15 comprising 

identifying at least one participant in said at least 
one reaction . 

41- The method of claim 40 f wherein said 
identifying at least one participant comprises 
20 associating a Homo sapiens protein with said at least 
one reaction. 

42. The method of claim 41, further 
comprising 

identifying at least one gene that encodes said 
25 protein. 

43. The method of claim 4 0, further 
comprising 

identifying at least one compound that alters the 
activity or amount of said at least one participant, 
30 thereby identifying a candidate drug or agent that 
alters a Homo sapiens physiological function. 
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44. The method of claim 21, further 
comprising 

providing a gene database relating one or more 
reactions in said data structure with one or more genes 
5 or proteins in Homo sapiens. 

45. A method for predicting a Homo sapiens 
physiological function, comprising: 

(a) providing a data structure relating a 
plurality of Homo sapiens reactants to a plurality of 
10 Homo sapiens reactions, 

wherein each of said Homo sapiens reactions 
comprises a reactant identified as a substrate of the 
reaction, a reactant identified as a product of the 
reaction and a stoichiometric coefficient relating said 
15 substrate and said product, 

wherein at least one of said Homo sapiens 
reactions is a regulated reaction; 

(b) providing a constraint set for said 
plurality of Homo sapiens reactions, wherein said 

20 constraint set includes a variable constraint for said 

regulated reaction; 

(c) providing a condition-dependent value to 

said variable constraint; 

(d) providing an objective function, and 
25 (e) determining . at least one flux 

distribution that minimizes or maximizes said objective 
function when said constraint set is applied to said 
data structure, thereby predicting a Homo sapiens 
physiological function. 



30 



46. The method of claim 45, wherein said 
value 
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provided to said variable constraint changes in 
response to the outcome of at least one reaction in 
said data structure. 

47. The method of claim 45, wherein said 
5 value 

provided to said variable constraint changes in 
response to the outcome of a regulatory event. 

48. The method of claim 45, wherein said 
value 

10 provided to said variable constraint changes in 
response to time. 

49. The method of claim 45, wherein said 
value 

provided to said variable constraint changes in 
15 response to the presence of a biochemical reaction 
network participant. 

50. The method of claim 49, wherein said 
participant is selected from the group consisting of a 
substrate, product, reaction, enzyme, protein, 

20 macromolecule and gene. 

51. The method of claim 45, wherein a 
plurality of said reactions are regulated reactions and 
said constraints for said regulated reactions comprise 
variable constraints. 

25 52. A method for predicting Homo sapiens 

growth, comprising: 

(a) providing a data structure relating a 
plurality of Homo sapiens skeletal muscle cell 
reactants to a plurality of Homo sapiens skeletal 

30 muscle cell reactions, 
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wherein each of said Homo sapiens reactions 
comprises a reactant identified as a substrate of the 
reaction, a reactant identified as a product of the 
reaction and a stoichiometric coefficient relating said 
' 5 substrate and said product; 

(b) providing a constraint set for said 
plurality of Homo sapiens reactions; 

(c) providing an objective function, and 

(d) determining at least one flux 

10 distribution that minimizes or maximizes said objective 
function when said constraint set is applied to said 
data structure, thereby predicting Homo sapiens 
skeletal muscle cell energy production. 

53. A method for making a data structure 
15 relating a plurality of Homo sapiens reactants to a 
plurality of Homo sapiens reactions in a computer 
readable medium or media, comprising: 

(a) identifying a plurality of Homo sapiens 
reactions and a plurality of Homo sapiens reactants 

20 that are substrates and products of said Homo sapiens 
reactions; 

(b) relating- said plurality of Homo sapiens 
reactants to said plurality of Homo sapiens reactions 
in a data structure, 

25 - wherein each of said Homo sapiens reactions 

comprises a reactant identified as a substrate of the 
reaction, a reactant identified as a product of the 
reaction and a stoichiometric coefficient relating said 
substrate and said product; 

30 ( C ) determining a constraint set for said 

plurality of Homo sapiens reactions; 

(d) providing an objective function; 

(e) determining at least one flux 
distribution that minimizes or maximizes said objective 
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function when said constraint set is applied to said 
data structure, and 

(f) if said at least one flux distribution 
is not predictive of a Homo sapiens physiological 
5 function, then adding a reaction to or deleting a 
reaction from said data structure and repeating step 
(e), 

if said at least one flux distribution is 
predictive of a Homo sapiens physiological function/ 
10 then storing said data structure in a computer readable 
medium or media. 



54. The method of claim 53, wherein a 
reaction 

in said data structure is identified from an annotated 
15 genome. 

55. The method of claim 54 f further 
comprising 

storing said reaction that is identified from an 
20 annotated genome in a gene database. 

56. The method of claim 53, further 
comprising 

annotating a reaction in said data structure. 

25 57. The method of claim 56, wherein said 

annotation is selected from the group consisting of 
assignment of a gene, assignment of a protein, 
assignment of a subsystem, assignment of a confidence 
rating, reference to genome annotation information and 

30 reference to a publication. 



58. The method of claim 53, wherein step (b) 



10 
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further comprises identifying an unbalanced reaction in 
said data structure and adding a reaction to said data 
structure, thereby changing said unbalanced reaction to 
a balanced reaction. 

59. The method of claim 53, wherein said 
adding a reaction comprises adding a reaction selected 
from the group consisting of an intra-system reaction, 
an exchange reaction, a reaction from a peripheral 
metabolic pathway, reaction from a central metabolic 
pathway, a gene associated reaction and a non-gene 
associated reaction. 

60. The method of claim 59, wherein said 
peripheral metabolic pathway is selected from the group 
consisting of amino acid biosynthesis, " amino acid 
15 degradation^ purine biosynthesis, pyrimidine 
biosynthesis, lipid biosynthesis, fatty acid 
metabolism, cofactor biosynthesis and transport 
processes. 

61. The method of claim 53, wherein said 
20 Homo sapiens physiological function is selected from 
the group consisting of growth, energy production, 
redox equivalent production, biomass production, 
production of biomass precursors, production of a 
protein, production of an amino acid, production of a 
25 purine, production of a pyrimidine, production of a 
lipid, production of a fatty acid, production of a 
cofactor, transport of a metabolite, development, 
intercellular signaling, and consumption of carbon 
nitrogen, sulfur, phosphate, hydrogen or oxygen. 

30 62 . The method of claim 53, wherein said 

• Homo sapiens physiological function is selected from 
the group consisting of degradation of a protein, 
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degradation of an amino acid, degradation of a purine , 
degradation of a pyrimidine, degradation of a lipid, 
degradation of a fatty acid and degradation of a 
cof actor. 

5 63. The method of claim 53, wherein said 

data 

structure comprises a set of linear algebraic 
equations. 

64. The method of claim 53, wherein said 
10 data 

structure comprises a matrix. 



65. The method of claim 53, wherein said 
flux 

distribution is determined by linear programming. 



15 



66. A data structure relating a plurality of 
Homo sapiens reactants to a plurality of Homo sapiens 
reactions, wherein said data structure is produced by a 
process comprising: 
20 (a) identifying a plurality of Homo sapiens 

reactions and a plurality of Homo sapiens reactants 
that are substrates and products of said Homo sapiens 
reactions; 

(b) relating said plurality of Homo sapiens 
25 reactants to said plurality of Homo sapiens reactions 
in a data structure, 

wherein each of said Homo sapiens reactions 
comprises a reactant identified as a substrate of the 
reaction, a reactant identified as a product of the 
30 reaction and a stoichiometric coefficient relating said 
substrate and said product; 
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(c) determining a constraint set for said 
plurality of Homo sapiens reactions; 

(d) providing an objective function; 

(e) determining at least one flux 

5 distribution that minimizes or maximizes said objective 
function when said constraint set is applied to said 
. data structure, and 

(f) if said at least one flux distribution 
is not predictive of Homo sapiens physiology, then 

10 adding a reaction to or deleting a reaction from said 
data structure and repeating step (e) , 

if said at least- one flux distribution is 
predictive of Homo sapiens physiology, then storing 
said data structure in a computer readable medium or 

15 media. 
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